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UNIT 1: MATHEMATICAL BASI CS OF 
DIFFERENTIATION AND INTEGRATION 
Introduction 


The technique for converting a real world problem into a 
problem expressed in concrete or mathematical terms is called 
quantitative techniques. The converted mathematical problem is 
called a mathematical model of the real world problem. The real 
world problem 1s in structured and complicated. We make certain 
assumptions regarding the real world problem. Based on the 
assumptions, mathematical model is constructed and solved using 
available mathematical technique. If the outcome arrived at using 
mathematical model do not differ significantly from the actual 
outcome, the model gets validated and approved. 

Unit Objectives 
The objectives of this unit are 

To understand constant and variables 

To understand basics of differentiation 

To understand basics of integration 

To calculate simple interest and compound interest 

To apply derivatives in social and economic problems. 
Linear function and Non-Linear function: 
Linear function 

If the degree of polynomial involved in the polynomial 
function is one then it is called a linear function. 
Examples: 

(1) f(x)=2x-4 


(ii) g(x)=3x-2 are linear functions 


Non-Linear function l l l 
If the polynomial involved in polynomial function 1s of 


degree more than 1, then it is called a non-linear function. 
In particulars, 
(i) If the polynomial involved in polynomial function is of 
degree 2 then it is called a quadratic function. 
Example: (i) f(x)=3x°+2x+1 
(ii) g(x)=2x7-2x+1 
(ii) If the polynomial involved in polynomial function is of 
degree 3 then it is called a cubic function. 
Example: (1) f(x) —Xx ^x^ -x41 
(ii) gCO-x -x-1 
Constants and variables 
1. A quantity which remains same in its value is called 
constant. If the value of the constant 1s a real number, then 
it 1s called is a real number, then it is called real constant. 
2. A quantity which changes in its value is called a variable. 
If a variable takes the values which are real numbers only, 
then the variable 1s called a real variable. 


The variables are usually denoted by u, v, x, y, z, etc. 


Graphical representation of functions 
Let A and B be two non empty sets. If to each element of 


x B, there corresponds a unique elements y B, by means of a 
rule denoted by ‘f then we say that a function or mapping “f” is 


defined from the set A into B. 





Domam Co-domain 


Definition (Real values function 

If to each value of a real variable x a unique real number y 
is associated, by means of a rule ‘f, then we say the variable y is 
a real valued function of the real variable x. This we denote it by 
y—f(x). 

The variable x is called the independent variable and the 
variable y is called the dependent variable. 
Notion of Mathematical Modes 
Definition 

Let f(x) and g(x) be two real valued function of real 
variable x, defined on the same domain. Then the function frg, f- 
g, f.g, f/g and k.f where k is the real constants are defined as 
below. 
(fg) (x) = f(x) + ga) 
(f-g) CO = f(x) — gGO 
(f.g) GO = fGO.gGo 
(5) (x) — 199 where g(x) + O 


BEX} 


Ck. q(x) = k. f(x) 


cf PP 


Illustration 
If f(x) =x? and g(x) = 3x-1 find the functions. (f+2)CXx), 


(f-g)(x). (f.g) (x), E) Cx) and (6£)(x). 


Solution 
Given that, f(x)—x^/, g(x)=3x-1 
a) (fg) (x) = f(x) + go) 
= x^-3x-1 
b) (£-g) Go) = f(x) — g6) 
— x^-(3x-1) 
c) ŒD = fC).g(x) 
— x^(3x-1) 


3 2 
= 3x -x 


d Qc - fea 


G(X) 





X 


^— 8x—1 


i 
Here note that, when x = = 


g(x)—O 
e" (x) is not defined for x — = 

(ie) (6f)(x) = 6f(x) = 6x” 
Illustration 

If f(x)-2x?-4 find £(1), £(2), f1) 
Solution 

Given that f(x)=2x?-4 

a) f(1)=2x17-4 
= 2-4-—-25 


9 £G)-2G) -4 


4 


= 2x — 4 =I 4 
4 x z 





2 
2 
c) f(-1)-2x(-1)^-4 
— 2x1-4—-2 
Types of function 
1. Constant function 
A function y=f(x} is said to be constant function, if for 
every value of ‘x’ the value of f(x) of the function 1s a 
same constant say k. 
Ge) y=f(x)=k for all real values of x. 
Example 
f(x)—2 is a constant function 
f(0)—3, f(-1)22, f (3) — 2 etc., 
2. Identity function 
The function f(x)—x, for all real values of x is called the 
identity function defined on the set of real numbers. 
In this case f(1)—1, f(x)—2, f(-5)—-5 etc., 
3. Polynomial function 
A function of the form f£ (x) = agx” + a,x" 14...-+a, 
where n is a positive integer, 0&$4,,,05,...0, are rational 
constants, is called a polynomial function. 
The right hand side expression is called a polynomial in 
the variable ‘x’ of degree n. 
The following are few polynomial functions 
(i) £(x)=3x*-4x4+2 (ii) g(x)=3x-4 


4. Exponential function 
The function defined by y=e*, for all real number x is 


called the exponential function with base. 


5. Logarithmic function 
The function defined by y=log e" for all x>0, is called 


natural logarithm function of simply logarithm function. 
Simple interest and compound interest 


Simple Interest 
Interest is the payment of extra money made by a 


borrower for having used the money lent by others for certain 
period of time. If a borrows Rs. 10,000 from B for one year, A 
has to pay back Rs. 10,000 plus some extra amount as a 
consideration for having used B’s money. The extra payment 
made 1s known as Interest. 

The money actually borrowed is called principal and the 
total sum comprising principal and interest is called the amount. 

Simple interest is the interest always calculated on the 
principal. It is assumed that simple interest is paid at the end of a 
specified period regularly. 

The formula for caiculating simple interest is Pni. 

‘p’ stands for principal 

‘n’ stands for number of years (period) 

1 stands for rate of interest per unit 
By using the above principle, the following formula can be 
obtained. 

a) Total amount A-—P(1-ni) 

b) Simple interest (SI) — Pni 


c) Principal (P) — = 

d) Period (P) = = 

e) Rate of interest per unit zi = Eid 
pn 


Illustration 


Find simple interest on 


(1) Rs. 1600 for one year at 10% p.a. 
(11) Rs. 2500 for 15 months at 15% p.a. 
Solution 


G) Given that p=Rs. 1600, n=1, i=10% = 0.10 
Simple interest SI— Pni 
= 1600 x I x 0.10 
= Rs. 160 
(11) Given that P=Rs. 2500 
n=— years 
12 
i = 15% = 0.15 per unit 
Simple interest = Pni 
= 2500x 72 x0.15 


= Rs. 468.75 
Hilustration 


At what rate percent per annum will 


a) The simple interest of Rs. 75 for 9 months is Rs. 4.50? 


b) Simple interest on Rs. 25 for 3 years is Rs. 4.50? 
Solution 
a) Given that SI-Rs. 4.50, P=Rs. 75; n = 2 
Rate of interest per unit £ = = 


4.50 
[2 
PIX 


a 
Lr d 





4.50 
$6.25 


= 0.08 = 8% 





Rate of interest = 8% 


b) Given that P=Rs. 25, SI-Rs. 4.50; n=3 years 
4.5 


Rate on interest per unit = —— — parr 75 


Rate of interest = 6% 
illustration 
Find by using appropriate formulae, 
a) The amount of Rs. 700 invested for 272 years at 614% 


p.a. 
b) Time in which Rs. 770 will amount to Rs. 847 at 5% 


p.a. 
c) The sum which yield simple interest of Rs. 77 in 8 
years at 3⁄2% p.a. 
Solution 
a) Given that, P=Rs. 700; n=2.5 years 1—0.0625 
Total amount A = P (1+ni) = 700 (142.5 x 0.0625) 
= 700 (1+0.15625) 


= Rs. 809.38 
b) Give that P=Rs. 770, SI=A-P=847-770=Rs. 77 
1= 0.05 
Time n = = = —— =“ = 2 years 
Thi 7x11. 05 RRS 
c) SI = Rs. 77, n=8 years, 1—0.035 
Principal sum P = E = ——_ = —"_ = Rs.275 
TLE 3 x 0.035 n.028 


Compound Interest 

Compound interest is the interest calculated on the 
principal and accrued interest. The interest unpaid is added to the 
principal and for the next period of interest is computed. In other 


words, compound interest on the growing principal. Both 


3 


principal 


and compound interest change from time to time. 


Compound interest can be computed annual, half yearly, 


quarterly or monthly. 


The following formulae are kept in mind solving the 


problems concerning to compound interest 


1) 
2) 


3) 
4) 


CI=P(1+1)"-P 
A=+P(1+i)" 
P=A-+(1+i)” 
1i." o, 
A — P(a +=) if CI is paid half yearly and ‘n’ 


indicates the number of *half years' in the period. 


5) A = (1 + i)” (a + D etc. if CI is paid annually but 
for each period o time an different rate of interest is 
paid. 

Illustration 


Find the compound interest in following cases 


a) 
b) 


c) 


d) 


Solution 


On Rs. 10,000 for 2 years at 10% p.a. paid annually 

On Rs. 6,000 for 2% years at 10% p.a. paid 

On Rs. 5,000 for 1% years at 5% p.a., CI paid half 
yearly 

Rs. 1,000 for one year at 10% p.a., Cl is paid quarterly 


Give p=Rs. 10,000, n=2 years, 1—0.10 
Amount A = P(1+4 D” 


= 10000 (1+0.10)” 
10000 (1.1)? 
— 12, 00 


Compound interest CI — A-P 


— 12100-10000 
— Rs. 2100 
9 


b) P=Rs. 6000, n=2% years; i=0.10, = = 0.05 
ee | 
A= P(1+i)?(1 +5) 
= 6000 (1--0.10)? (1+0.05)' 
= 6000 x (1.10% x 1.05 
= Rs. 7,623 
c) P=Rs. 5000, n=1 1⁄2 years = 3 half years, 1—0.05 


* — 0.025 
= 


A=P(1+4) 
= 5000 (1+0.025)° 
= 5000 (1.025) 
= 5000 x 1.0769 
= Rs. 5384.45 
CI = A-P = 5384.45 — 5000 
— Rs. 384.45 
d) P—Rs. 1,000; n-1 years or 4 quarters; 1—0.10 


= = 0.025 quarteriy 


A=P (1 + D» 
= 1000 x (1+0.025)* 
= 1000 x (1.025)* 
= Rs. 1103.83 
CI =A-P 
= 1,103.83-1,000 
= Rs. 103.83 


Illustration 
Find the amount if the principal is Rs. 10,000 and the rate 


of interest is 15% p.a. for 2 years if (a) CI is paid annually (b) CI 


is paid half yearly. 
10 


Solution 
Given that P=Rs. 10,000, i=0.15 per unit, n=2 years 
a) If CI 1s paid annually, 
A=P (i+i)” = 10,000 (140.15)? 
= 10,000 (1.15) 
= Rs. 13.225 
b) If CI is paid half yearly 


EER 
A=P(1+ =) = 10000(1 + 0.075)^* 


= 1000 (1.075)* 
— Rs. 13,354.69 


BASIC CONCEPT OF DIFFERENTIATION 
Differential coefficient 

Consider the function y=f(x). Here x is the independent 
variable and y is the dependent variable. The value y of the 
function f(x) changes as the variable x changes. The rate at which 
the function changes, is of great importance in the study of the 
subject *calculus". The process of finding the rate of change of 
the function is called differentiation. 
Differential coefficient of a certain basic functions 


1. Differential coefficient of a constant function is O 


(ie) Gy co 


2. Differential coefficient of the function y=x" where n is any 
rational number, 


d i 
If y=x" then — = nx" i 


3. Differential coefficient of the exponential function 


x dy —— a 
If ye then 4,-— e 


11 


4. The differential coefficient of the product of two functions 


is given by 


If y—uv t en. ta vm 


5. The differential coefficient of the quotient of two function 


is given by 


Illustration 


Differentiate the following w.r.t.x 


i l 
a) x’ b) x^^ C) zu C) i573 
Solution 
a) Let y=x’ 
Differentiate with respect to x, 

e T 

PF a 7x^ —t 

cix 

= Fx" 


n 
b) Let y = x2 


Diff. w.r.t.x 
P 
a ec eee 
x 3 
4 i 
== — Xa 
a 
1 —10 
c) Let y = s = x 
Diff.w.r.t.x 
ay —10— m 
— = —10 191. —10x i 
N s 


12 





d) Let y = -= — -5/2 


Diff. w.r.t.x, 


Illustration 
Differentiate the following w.r.t.x 


a) x? — 3x^ + 4x 4+ 3 b) x* + 3logx — 4e* 


Solution 
a) Let y=x°-3x7+4x4+3 
Diff.w.r.t.x, 

— = 3x? — 6x + 4 


b) Let y=x°+3logx-4e* 
Diff.wr.t.x, 


a 3 
T? = 8x*--— 4e* 
ax x 


Illustration 


Differentiate the following w.r.t.x 


i—x? 





a) (x? — 3x + 2)e* b) 3 
Solution 

a) Let y=(x°-3x+2)e” 

Diff. w.r.t.x 


y 3 —— aáa x x di 3 — 
dx ^7 X 3x + 2)—(e }+e OU 3x + 2) 
— (x? — 3x + 2)e” + e” (3x? — 3) 


— (x3 + 3x* — 3x — 1)e* 


13 


1—x*) 
b) Let y = m. 


Diff.w.r.t.x 
dy (+x) =a — x?) — (1 — x*) (a + x*)) 
dx (1 +x?) 
oo tr? ezana a2 
Ct+x"7* 
—2x— —2x3-224+2x" 


(i+.2*)* 


— 4x 
(Ix? s 


2.2. Higher order derivatives 
If a function is differentiated more than one time is called 


higher order derivative. 


d? 


The second order derivative is denoted by rx] 
Illustration 
Find the second order derivative x^-3x^43x--7 
Let y—x^-3x 43x47 
Diff. w.r.t.x 








SY - 3x? — 6x + 8 
cx 

Again, diff. w.r.t.x 
d?y E p 


Illustration 
Find the second derivative of 2x^-3x^-36x--10 


Let y=2x°-3x*-36x+10 





Diff. w.r.t.x 
TY = 6x? — 6x — 36 
cix 
Again diff.w.r.tx, 
cet? 
a = 12x — 6 


14 


Integration 
Definition of a integral 

The integral of a function f(x) with respect to x is that 
function whose derivative with respect to x in f(x) and is written 
as f(x) dx. 

Generally f(x)dx= (x)-c, since c being an arbitrary 
constant can take any value, f(x)dx is called the indefinite 
integral. 

In this is the sign of integration f(x) is called the inegrand 
and x is called the variable of integration. The process of findings 
the integral is called integration standard forms. 


1. x"gdx = — 7 + c for all values of n except n—-1 


cix 

2 .clegx +e 
3. f e*dx =e* +c 
4. f e?*dx = *— 
5 


7t 


aV 





AL PEN 
. fa dx = ra 


Illustration 


Evaluate f x?^dx 





Solution 
2 E ac) Ft 
f x*^dx = zu ee 
EN 
Illustration 


Evaluate f Cu — 3x5 + =) dx 
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Solution 
3 = 1 
J (x — 3x + =) ax 


1 
== (x? — 3x2+ x?) ax 








| 
| 
D 


Definition 
Let f f(x)dx = F(x)-rc where c is the constant of 


integration. 
The value of the integral when x=b is F(b)+c. The value of 


integral when x-a is F (a) +c. subtracting 

F (b)-F (a) = value of f f(x)dx when x=b-value of 
f fCx)dx when x=a 

The notation J. ° f(X) is used to denote the value of 
integral when x=b the value of the integral when x=a. 

D f (x) dx is called a definite integral. 

Its value is F (b)-F (a) where F(x) is the integral of f(x) 
with respect to x. ‘a’ and ‘b’ are called the limits of integration, 
‘a’ being the lower limit and b being the upper limit. 

Í E f (x) dx is to be read as integral from a and b of f(x) dx 
the integral a to b is called the range of integration. 

Illustration 


Evaluate S + x? + x)Jdx 
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Solution 
J, G3 +x? + x)dx = (z^ i a 2) 
& a8 
ui cx Rs ae © We E 
= ($+ 24 PE 


I (2922024 2932024) — (Gus +4+6 


_ 1304 i3 
^ — 12 i2 
_ 104— 13 
B £2 

9t 


p 
Evaluate i (x? — 3x2 + = ax 


L— a? 2X2 1p 
a Ag 
- B-2-3-E-2- 3] 


— “= — 4/2 
APPLICATION OF THE DERIVATIVE 
The geometric meaning of (=) is the slope of the tanget 
to the curve y=f(x) at the point (x,y). 
(=) is also called the rate of change of y with respect to 


x.The cost of C comprised to two component — fixed cost and the 


variable cost. It is a function of output x. (x is the number of units 


of output) 
TC=FC+VC 
C-—f(x) 
Average cost (AC) — mà ~ 
Marginal cost (MC = T 
SO) 


Marginal average cost (MAC) = mum 
17 


Example 

Find the average cost and the marginal cost functions from 
the total cost function. 

C = 60 + 10x + 15x^ 


f Total cost 
Average cost = —, jut 


60--10x--152x^ 


X 


— 4 10 + 15x 


Marginal cost (MC) = Č = 0 + 10(1) + 15(2x) 


= 10430x 
Revenue 
Total revenue R = Price x quantity sold 
R = Px 
Totali revenus 
Average revenue = —@ 2a = P 
quantziy said 
: aR 
Marginal Revenue = Es 


= — (px)—P-rx <= (Here the product rule of Calculate 


is applied) 
Elasticity Function 
The elasticity of the function y=f(x) at the point x is given 


by 
a dy 


y cix 
Example 
If y=3x-6, find the elasticity of y 


Solution 


Elasticity — n =- 


av 
Ay =3x—6 “= =3(1)-0=3 


18 





3ex—-Z) | x—2 


Elasticity Demand 


If x—f(p) is the demand function, where x is the demand, 


and p is the price then the elasticity of supply 1s 
pax 


Ts x dip 


Example 


Find the elasticity of supply for the supply function 





= 2p^ +5 
— : p dx 
Elasticity of supply is given by n, — em 
As x — 2p? +5, = 2(2p) +0 = 4p 
P 4p? : 
"sc z- 4p aere ee 2p^ +5 
Example 2 


If the demand law is x=20/p+1, find the elasticity of 


demand at the point where p=3. 





Solution 
=20/p+1 

dx © +1020) -2 o5 41^ 

dp Cp 11 

__ £pb-1)(0—20(140) _ 20 

(pri) (p-r1iy* 
Ee pp dx 

Elasticity of demand t, ~ -- 5 





= (v/ AA (p4+137 —9 i) —- (Considering 
magnitude) 


Elasticity of demand when p=3 1s 


3 
A1 





3 
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Example 3 
The total cost C of making x units of product is 


C=0.00003x2-0.045x2+8x+25000. Find the marginal cost at 1000 


units output. 


Solution 
C=0.0000x?-0.45x7+8x+25000 


Marginal cost = = = (0.00003) 3x? — (0.45)2x + 8.1 


= 0.00009x*-0.09x+8 
Marginal cost at x = 1000 is 0.00009 (1000) — 
0.09(1000)+8=8 


Example 4 
For the demand function p=550-3x-6x* where x is the 


quantity demanded and p is the price per unit, find the average 
revenue and marginal revenue. 
The revenue function is given by 
R — Px 
= (550-3x-6x^) x 
= 550x-3x*-6x> 
Average revenue is given by R = = = 550 — 3x — 6x? 
Marginal revenue - = = (550x — 3x* — 6x?) 
— 550-6x-18x^ 
OPTIMIZATION CONCEPT 
MAXIMA AND MINIMA 
Everybody wants to maximize his gain and minimize his 
loss. Any decision to be taken by an organization, when 
quantified, reduced to a cost function or a profit function. We 


went to minimize the cost function and maximize the profit 
function. 
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When the cost / profit function is linear, we can draw the 
graph of the function easily and find the maximum / minimum 


value from the graph itself. 

When the function is a quadratic function, we can use 
calculus to find the maximum or minimum. Fig 1.7 gives two 
functions having a minimum and maximum at x=xọ respectively. 
In both cases, the tanget at (x0,f(x9)) is parallel to the x-axis. As 
(=) x = Xg is the slope of the tanget x=xo, (m) x — xg has to 


zero,. 


MX 7D 


Xo Fig 1 


To distinguish between attainment of maximum and 





$ed d? y d? y : "— 
minimum, we evaluate (—7) x = xo. I —)x — xg is positive, 


a Y 





the function y=f(x} has minimum at x-—xo. If x = Xp is 


negative, the function y=f(x) has maximum at x=xọo. (If 


- 2 
2 =) x = xg = 0, then we have conditions for maximum in 





terms of higher derivative. This is beyond the scope of our study. 
Procedure to find Maxima and Minima 
Let y=f(x) be a function, whose maxima and minima are to 


be obtained. 


i) Find t) and equate it to zero 
x 
. * ay a La] 
ii) Solve the equation 4- = 0 obtain its roots, say X1,X2,..-. 
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. If it is negative they y=f(x) 





iv) Find the value o 
attains a maximum at x=x;. If it is positive then y=f(x) 


attains a minimum. 


Perform step iv) for each root to get all the maxima and 


minima 
Example 1 
Find the points or maxima and minima of the function 
y=X 3 3x^-5 
Solution 
y—x^-3x^5 
- £X — 3x? — 6x 
dx 
= 3x(x-2) 
<> — 0 when 3x(x-2)-0 


That is, x=O or x—2 


: ae dy ; 
Differentiating ~~ again w.r.t x, we get 





2. 
= = = 6x — 6 
dax 
ES l 
When x=0, 7.3 = —6 « O (negative) 


and when x=2, < = 6 > D (positive) 


The function attains its maximum at x=0 and minimum at 
x=2. At x=0, y=0-0+5=5 is the maximum value and at x=2, y=3- 
12+5=1 is the minimum value. 

Example 2 : 

Investigate the maxima and minima of the function y- 

2x°+3x*-36x+16 
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Solution 


y=2x°+3x’-36x+16 


Uy» _ =, 2 
^ n Ox + 6x — 36 


dy 
Equate Pus to zero 


G) == 06x? + 63x — 36 = 0 

When divided by 6, we get x*+x-6=0 

While solving the equation x*+x-6=0, wet get (x+3) (x- 
2)=0 

Hence x=-3 (or) x=2 


: M ay : 
Differentiating ae Once again w.r.t.x, 


cu?» 
a — 12x +6 
d? 
At x=-3, = = —36 + 6 = —30 <0 


At x=-3, the function y has maximum value. To get the 
maximum value substitute x=-3 in y. 
y=2(-27)+3(0)-36(-3)+10 
= -54+27+108+10 
y —91 
91 is a maximum value 
At x=2, 5 = 24 + 6 = 30> 0 
The function y has a minimum value at x=2. Substitute 
x=2 1n y. 
y=2(8)+34-72+10 
= 16+12-72+10 
= -34 


The minimum value is -34. 
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Example 3 | 
Find the maxima and minima value of the cost function 


C-—542x^-x^ 

SOLUTION 
C—522x^-x^ 
zt = 4x — 8x? 
cix 


ac 
— to zero 
Equate 4. 


: ac __ o 29 
Ge) = = 0 — 4x — 3x 
=> xid4— 3x) = Q0 
=> x = 0 Cor)x = 4/3 
ac 
Differentiating ,. once again W.r.t.x, 


d*c 
— = 4— 6x 
cix 


= *- - 
when x=0, M — 4 — 6(0) = 4 > 0. Hence C has a minimum at 


x=0. 
The minimum cost = 5+2 (07-07% =$ 
4 d?c 4 
When x = Jizz = 4 e (= 
= 4-8=-4<0. Hence C has a maximum 
4 42 442 
When x =2;¢=5+2(5) —(G) = 167/27 
Example 4 


A cricket association in a city is arranging an inter-state 
cricket match. The association estimates that 3000 spectators 
would attend the match if the ticket costs Rs. 80. It feels for ever 
decrease of Rs. 10 in the ticket price, 600 additional persons will 


buy tickets. Find the optimal ticket price so that the association 


will get maximum revenue. 
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SOLUTION 

Let p denote the price / ticket and n be the number of 
spectators. We can assume that the number of additional 
spectators is proportional to the decrease in the price of a ticket. 
So far decrease of one rupee in ticket price, the increase in 
number of spectators in 60. 

n = 3000 if p=80 

n = 3000 + 60x if p=80-x 

Total revenue R=n.p 

= (3000--60x) (80-x) 


= 240000--1800x-60x^ 


daR 


— = 18300 — 120x 
cx 


rr S = 0,1800 —120x = Q jie x = 15 


d?m 
(5) = —120 — 0 


Hence R attains maximum when x=15. 

So the optimal ticket price is 80-15—65. 

Maximum Revenue = 240000+1800(15)-60(15)” 

— 253500 rupees 

When the profit / cost is linear or piecewise linear (that 1s, 
it is given by two different linear functions for different range of 
values), we cannot apply calculus to get maxima or minima. In 
this case, we can represent the function graphically and get the 


point at which maximum / minimum 1s attained. 


SUMMARY 
In this unit we came to know that the first and higher derivatives 
of a function, maxima and minima of a function and application 


of derivatives. 
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KEYWORDS 
Fixed cost 
Variable cost 
Average cost 


Marginal cost 


KEY TO CHECK YOUR PROGRESS 


1. Find the second derivative of 2x? 3x” 36x 10. 


Ans:12x 6. 
2. Find a if y= 5x? 6x? 3. 


Ans:15x* 12x. 


Exercise: 
1. Find the maximum value of the function 


x^ 2x? 3x? Ax 4. 


2. The total variable cost of monthly output x tons by firm 


producing a variable metal is Rs. Lc 3x^ 5x and the 


fixed cost is Rs. 300per month. Find the output of 


minimum average cost. 
3. The unit demand function is x25 2p), where x is the 


number of units and p is the price. Let the average cost per 
unit be Rs. 40. Find 

i. The revenue function of R in terms of p, 

ii. The cost function C, 

111. The profit function P, 

iv. The price per unit that maximizes the profit function, 


v. The maximum profit. 
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UNIT 2 CENTRAL TENDENCY, DISPERSION AND 
CORRELATION 
INTRODUCTION 


One of the most important objectives of statistical analysis 
is to get one single value that describes the characteristic of the 
entire mass of unwieldy data, such a value is called the central 
value or an average or the expected value of the variable. The 
word average is very commonly used in day-to-day conversation. 
For example, we often talk of average boy in a class, average 
height or life of an Indian, average Income etc., when we say , 
‘he is an average student’, what it means is that he is neither very 
good nor very bad, just a mediocre type of student. However in 
statistics the term average has different meaning “Average is an 
attempt to fine one single figure to describe whole of figures" 
Unit Objectives 

On learning this unit, you could able to 

Study about Data analysis. 

Discuss about measurers of Central Tendencies. 

Discuss about measures of Dispersion 

To find the relation between two variables. 

To find the correlation between the values in terms of their 

ranks. 

To calculate the unknown variable with the help of a 


known variable. 
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Unit Structure: 


2.1 Data Analysis. 

212 Measures of Central Tendency. 
2.3 Measures of Dispersion. 

2.4 Correlation Analysis. 

2.5 Regression Analysis. 
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Statistical data constitute the raw material for statistical 
methods. The statistical data may be classified into two types. 
They are 

(1) Primary data and 

(11) Secondary data 
Primary Data 

The processed data which 1s given to the user is called 
information. If the data 1s collected for the first time then it is 
called Primary data. 

Example: 
1. Attitude survey on a particular brand of product available 


in the market. 


2. The reasons for absenteeism among employees in an 
organization. 
3. Opinion poll about the election results. 


The above examples represent primary data and are 


generated for the first time buy the investigator. 
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Secondary Data 


If data, which is already collected and used for some 
purpose is once again used for different study, then it is called 
secondary data. As far as secondary data is concerned the 


availability and reliability of existing data should be given more 


importance. 


Methods of Collecting Primary Data 


Observation Techniques Interview Techniques 
Questionnaires Survey Techniques 

Primary data is collected by Observation Techniques or 
Interview Techniques or Questionnaires Survey Techniques. 
Observation Method 

In this method the researcher has collected the information 
himself. This is more intensive rather than extensive. It gives 
more reliable information, because the researcher has cross 


examined particular situation in several ways. 


Merits: 
1. It is in depth study. 
2. More accurate date can be obtained. 
Demerits: 
I It is not uniform. It differs from person to person. 
2. It demands very high interpersonal skills of the 
researcher. 
3. It involves more cost and time. 
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Interview Method 


The interviewer and Interviewee are directly meeting each 
other. The interviewer asks questions and the responses from the 
interviewee were recorded. 


Telephonic Interview: 
In this method responses are collected through telephonic 


conversation. 
Merits: 
1. The information is unbiased. 
2 Adequate information is collected from the respondent. 
3; High degree of accuracy can be aimed. 
4 Responses will be encouraging because of personal 
approach. 
Demerits: 
1. Costlier and time consuming. 
2. In order to get real position sufficient numbers of 


persons are to be interviewed. 
3. It needs high skills for the interviewer to elicit proper 
responses. 


Questionnaire Survey 


A printed questionnaire which contains the questions for 
eliciting the required information is designed. The 
questionnaires are supplied to the respondents and the 


respondents are asked to fill up. 
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Merits: 


Demerits: 
1. 
2. 


It is the cheapest and less time consuming. 
It can be used extensively and the area coverage is very 


large. 


Error involved in the survey is less due to extensive 


area coverage. 


This is mainly used for literate people. 

Inelasticity is more, and so acquiring further 
information is not possible. 

We cam not get 100% response, since the rejection of 


questionnaire will be inevitabile 


Secondary Data 


There are two types of sources of secondary data 


(1) Published sources 


(11) Unpublished sources. 


Published Sources: 


Various governmental, international and local agencies 


publish statistical data and important among them are 


(a) 
(b) 
(c) 
(d) 


(e) 
(1) 


(g) 


International publications. 

Official publications of central and state governments. 
Semi Official publications. 

Publications of research institution such as Indian 
Statistical Institute, Indian Council of agricultural 
rescarch etc, 

Publication of Commercial and financial institutions. 
Reports of various committees and commissions 
appointed by the government. 


Journals and news papers. 
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Unpublished sources: 
There are various sources of unpublished data. They are 


records maintained by the various government and private 
offices, the researches carried out by individual research scholars 


in the universities or research institutes. 
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The measure of central tendency is called as measures of 


location. This measure of Central Tendency is called as 


“Average”. 
Average is defined as a single value with in the range of 


data that is used to represent all values in the series. If we 
consider the scope of average, the average is represented by the 


following measures. 


1. Arithmetic mean. 
2. Median 

3. Mode 

4. Geometric Mean 
5. Harmonic Mean 


Arithmetic Mean 


() The arithmetic mean x of n observations X;, Xo, .... Xn 1S 
given by 
x = Xb Xo... Xn 
n 


(ii) The formula for calculating average for data with 


frequencies is mathematically expressed as arithmetic mean. 
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x = dul; Xi Where N= 2f; 


N 
Where X - Arithmetic mean 
f; - frequency of i" item 
X; - The individual value of i item. 


Gii) In the case of grouped or continuous frequency 
distribution, the arithmetic mean x is given by 
n 
X =A+ difix; xc 
t=] _ 
N 
Where di = Xi-A 
C 
C — Length of the interval 
A- Assumed mean., 


Illustration 
The age of 10 retired pensioners are given as below 
74, 62, 84, 72, 61, 83, 72, 81, 64, 71. 


Calculate the average age of pensions who are getting 


pension. 
Solution: Average age = n 


> 6 
i=] 
n 
= 74+ 62 + 84 +72 + 61+ 83 + 72 + 


81+ 64+ 71. 
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Illustration: 
Calculate the monthly average pension payable per month. 
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Solution 


Monthly 
pension 
X 
20 


25 


No. of person receiving 
the pension 





Average pension x = 2 fx 
N 
= 705 
25 
= 28.20 
Illustration: 


Calculate the Arithmetic mean of the marks from the 


following table 
10- | 20- 30- | 40- 50- 
memes [e | is [r [20 [7 | 
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Solution 


No. of 
students 





Arithmetic mean —A--?fd Xc 
N 


=25+30 X10 
100 
= 25 + 300 
100 
=25+3 
= 28 


Note: If the grouped frequency distribution is not continuous, we 
first convert it into continuous distribution as in the 


following example, 


(Subtract 0.5 from lower limit and add 0.5 to upper limit) 


Calculate the Arithmetic mean of the marks from the following 
table 









20-29 30-39 40-49 50-59 





= = 
Nooten — | » 
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Solution 


No. of 
students 
F 


9.5-19.5 
19.5-29.5 


29.5-39.5 
39.5-49.5 
49.5-59.5 





Arithmetic mean =A+)> fd XC 
N 


=34.5+95 X10 
114 

= 34.5 + 950 
114 

= 34.5 + 8.3 

= 42.83 


Median 

(i) The median is the middle observation in data that have 
been arranged in ascending numerical sequence. 

(ii) In the case of frequency distribution Xi/fj,, median is 
obtained by considering the cumulative frequencies. The 
steps for calculating median are 

(a) Find N where N = 5f = total number of 
frequencies. 
(b) See the Cumulative frequency just greater than N/2. 


(c) The corresponding value of x is the Median. 
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Gii) In the case of continuous frequency distribution, the class 
corresponding to the cumulative frequency (C.F) just 
greater than N/2 is called the median class and the value of 


median is obtained by the formula 


Median =1+ h [Ncc j 
f 2 
Where 1 — Lower limit of the median class. 
f— Frequency of the median class. 
h — Magnitude of the median class. 
C — Cumulative frequency of the class 


preceding the median class. 
Iltustration 
In a telephone exchange the data on the inter arrival time 
in seconds was collected and the observations are given below: 


18, 18, 20, 22, 10, 8, 7. Find the median value. 


solution: 
Step 1 : Arrange the items in ascending order 
7, 8, 10, 15, 18, 20, 22 
Steps 2 : Median = [ n+ 1 ) th item 
2 


= [2n] item 


E } th item 


th item 
15 
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Illustration: 
In a telephone exchange, the data on the inter arrival time 


in seconds is given below: 
15, 18, 20, 25, 22, 10, 8, 7 Find the median value. 


Solution: 
Step I Arrange the items in ascending order 


7, 8, 10, 15, 18, 20, 22, 25 


Steps 2 : Median = [ n+l } ® item 


= 4.5" item 


= 4" item + 5” item 


Ulustration: 


Obtain the median for the following frequency distribution 
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Solution: 


The c.f just greater than 60 is 65 


Median class 
The value of x corresponding to 65 is 5 


Hence median = 5 





Illustration: 


Obtain the median wage of the following distribution 


20-30 30-40 40-50 50-60 60-70 






solution: 
Here N —43-—21.5 
2 2 
The c.f just greater than 21.5 is 28 


edian class 


Median 
=]+ h E mex 
f | 2 
Where! — Lower limit of the median class=40. 
f— frequency of the median class=10 
h- Magnitude of the median class. \= 20 
C — is the cumulative frequency of 





the class preceding the median class= 8 
39 


& Median = 40+ 10; 21.5-10 
20 


= 40+ 6.75 = 46.75 


Mode 
(1) Mode is the value which occurs most frequently in a set of 


observations and around which the other items of the set cluster 
closely. 

(ii) In the case of discrete frequency distribution mode is the 
value of x corresponding to the maximum frequency 


(iii) In the case of continuous frequency distribution mode is 


give by the formula 


Mode = 1+ h (fi- £5) 


2 fi-fo - È 
Wherel — is the lower limit of the Modal class. 
h— is the magnitude of the Modal class. 
fi— 1s the frequency of the Modal class. 
fo— is the frequency of the class preceding 
Modal class 
f,— 1s the frequency of the class succeeding the 
Modal class 
Modal class: - The class corresponding to the maximum 


frequency is called modal class 


Illustration 


Compute the mode for the following: 
1,9, 4, 2, 12, 4, 14, 15, 5, 14, 6, 14, 12 
Solution: By data 
1, 9, 4, 2, 12, 4, 14, 15, 5, 14, 6, 14, 12 
Mode = 14 
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Illustration: 


Find the mode for the following distribution: 





Maximum frequency 
Here maximum frequency is 25. 
The value of x corresponding to the maximum 
frequency 25 is 4 
Hence Mode = 4 


Illustration: 
Find the mode for the following distribution 


0-10 | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 
No of 
mem | 3 | 8 | 7 | 12 | 28 i 


Solution: 


X f 


















Model Class = 28 
(Maximum frequency) 





h (fi - f, 
Mode= 1 + 2f-f, - 


Where 1— 40 (lower limit of the Modal class) 
h— 10 (length of the class interval) 
fi— 28 (frequency of the Modal class) 
fo— 12 (frequency of the class preceeding 

the Modal class) 
f,— 20 (frequency of the class succeeding 
the Modal clas h (f; - fo) 
Mode = 1 + al 


-40+ 10 (28-12) 


2x28 -12 -20 
E 10 (16) 
=40+ 56-32 
= 40 + 160 

24 
= 46.667 


Empirical relationship between Mean (M), Median (Ma) and 
Mode (M,) 


In case of symmetrical distribution Mean, Median and 
Mode coinside (ie M = M, = M.. 
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Suppose the distribution is asymmetrical, Mean, 
Median and Mode usually will co not inside but they obey 
the following important empirical relationship, given by 
Karl-Pearson. 

Mode = 3 Median — 2 Mean 


Illustration: 


Compute Mean, Median and Mode for the following 
distribution. 


10-15 | 15-20 | 20-25 | 25-30 | 30-35 | 35-40 | 40-45 | 45-50 


Solution 
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Arithmetic mean = A+ > fd jt 


= 32.5 -255 X» 
1000 
= 32.5 -1275 
1000 
= 32.5 -1.275 
= 31.225 


Illustration: 
Find the missing frequency from the following data 


wem |s |o a| ie 


The arithmetic mean is 34 marks 
Solution 











The missing frequency be denoted by m 





= 2200+35m 
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Arithmetic mean x 


; 





N 
= 34+ 2200+ 35 m 
70 + m 
. 34 (70 + m) = 2200 + 35 m - 
2380 +34m, = 2200 + 35 m 
35m — 34m = 2380 — 2200 
m = 180 
Hence the missing value is 180 
Median 

N = 1000 = 500 
2 2 


Median =l1+ h N C 
f |2 
Wherel — 
f— 
3 
h — 


Median = 30+ _5_ ( 500 - 425 | 
303 


— 31.238 
Mode 


30 


30 


The class corresponding to the maximum frequency 


303 is 30-35 which is the ‘Modal class’. 


2 fi- f - f 
Where ] — 3 (lower limit of the Modal class) 
h — 5 (length of the class interval) 
fi— 303 (frequency of the Modal class) 
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£.— 270 (frequency of the class preceding 


the Modal class) 


f— 197 (frequency of the class succeeding 


the Modal clas B (fi- fo) 
oe. t 
Mode E q 


39 + 2 (303-270) 


606-207-197 


5 (3) 
— 45 * 139 


— 31.187 


Combined Arithmetic Mean 
If we have the arithmetic mean and numbers of items 


of two or more than two related groups, we can compute 


combined average of these groups by applying the following 








formula 
X. n, X, n Xa 
"m My 
Where 
| ao - Combined mean of two groups 
X,  - Arithmetic mean of first group 
X,  - Arithmetic mean of Second group 
ny - Number of items in the first group 
n> - Number of items in the second group 
Hustration 


The mean height of 25 male workers in a factory is 61 


inches and the mean height of 35 female workers in the same 
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factory is 58 inches. Find the combined mean height of 60 
workers in the factory. 
Solution 


Given, 
nı, = 25 n= 35 xı = 61 X2 = 58 





nX, n,X. 


Combined mean X, 
"n Me 


— 25 x 61+ 35x 58 
25+ 35 


= 1525 +2030 
= 3555 = 59.25 


Thus the Combined mean height of 60 workers is 
59.25 inches. 
Illustration 

in a certain examination the average grade of all 
students in class A is 68.4 and students in class B is 71.2. If 
the average of both classes combined are 70. Find the ratio 
of the number of students in class A to the number of student 
in B. 
Solution :- 

Let us assume that the number of students in class A 
was X and in class B was Y. 

We are given, x12 = 70 xı = 68.4 X2 

= 71.2 

mX, MX, 


Combined mean X, 
"n "m 
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70— n,(68.4) +m (71.2) 


nı + no 
70— x(68.4) + y (71.2) 
x+y 
70 (x+y) = 68.4x + 71.2y 


7-Ox + 70y - 68.4x - 71.2y = 0 
1.6x — 1.2y = 0 
1.6x = 1.2y 


Suppose X = 10 
1.6 x 10 = 1.2y 


1.2y = 16 

y= 16 
1. 

y= 40 
3 


Thus x and y are in the ratio of 10 : 40 or 30 : 40 


3 
Hence for every 3 students in class A, there are 4 students 1n 
class B. 


. Correcting incorrect values: 


If sometimes happens that due to an oversight or 
mistake in copying certain wrong items are taken while 
calculating mean. The problem is how to find out the correct 
mean. The process is very simple. From incorrect Ex 
deduct wrong items and add correct Ex by the number of 
observations. The result, so obtained, will give the value of 
correct mean. 
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Illustration 


The mean marks of 100 students were found to be 40. 
Later on it was discovered that a score of 53 was misread as 


83. Find the correct mean corresponding to the correct score 


Solution :- 
We are given n — 100, xı = 40 
Since x, = 2X 
n 
>x=nx, =100x 40 = 4000 


But this is not correct 2 x 


Corrected >x = Incorrect 2x — Wrong item + 
correct item 


4000 — 83 + 53 


= 3970 
Correct x = Correct > x 
n 
= 3970 
100 
= 39.7 


Hence the correct average = 39.7 


Geometric Mean 


Geometric mean is defined as the n^ root of the 


product of n items. 


Geometric mean = MAI X x X .... Xn 
Where n — number of items 
X1 , X2, X3 .....  -— are varies values. 
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Calculation of geometric mean — individual series: 


Steps: 
1. Find out the logarithm of each value or the size of 


item from the log table — log x 


2. Add all the values of log x - 2ulog x. 

3. The sum of log (ie 3log x) is divided by the number of 

items ie log x 

4. Find out the antilog of the quotient (from step 3). This 
is the geometric mean of the data 

Illustration : 


Compute geometric mean of the following 50, 72, 54, 82, 93 


Solution : 





Geometric mean = antilog in x 


n 


= antilog [ 2.1710 rm f 


= antilog 1.8342 
= 68.26 
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Calculation of Geometric mean — Discrete series: 


Steps: 
l. Find out the logarithm of each value - - log x 


2: Multiply the log of each size by its frequency — f log x 
3. Add all the products thus we get - flog x. 


4. Divide the total product by the total frequency N (ie _ 
f log x) 
n 


5. The antilog of the step & is the result G.e) GM = 
antilog flog x) 
N 
Ilfustration 
The following table gives the weight of 31 persons in 

a sample survey calculate Geometric mean. 

e eres [oe e rnm [m [o n 
No. of 

person 


Solution 
Calculation of Geometric Mean 


size of items frequency 
meme Temm T va. [ ren 
130 ; 














3 6.3417 
135 4 8.5212 
140 6 12.8766 
145 6 12.9684 
146 3 6.4932 
148 5 10.8515 
149 2 4.3464 
150 1 2.1761 





Geometric mean = antilog f N 


= antitog [ —31 ] 


= antilog 2.1539 
= 142.5 lbs 


calculation of Geometric mean 
steps :- 
1) Find the mid value of each - M 


2) Find the logarithm of the mid value — log m- 
3) Multiply the log of M by their respective frequency f | 
log m 
4) Add up all the products flog m 
5) Divide flogmbyN(ie) flogm 
N 


6) Find out the antilog of the result of step 5 and this will 


give the answer: GM = antilog flog m) 
N 


Find out Geometric mean for the following data:- 


11-13 | 14-16 | 17-19 | 20-22 | 23-25 | 26-28 
Ecce 
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Yield of m 
wheat 8-10 
mounds 
ZME 
person 









| Solution 
Calculation of Geometric Mean 


ee i LM ME I Im E 


7.5-10.5 | 4.7710 
10.5-13.5 ! 9.7128 
13.5-16.5 | | 22.3459 
16.5-19.5 | ! 28.8719 


19.5-22.5 7 i; 9.2554 
22.5-25.5 5.5208 
25.5-28.5 1.4314 


f log m = 
81.9092 





Geometric mean = antilog [ a | 


= antilog í T ) 


= antilog 1.045 
= 16.02 mounds 





Harmonic mean:- 

Harmonic mean is the reciprocal of the arithmetic 
average of the reciprocal of values of various items in the 
variable. 


Calculation of Harmonic mean — Individual series 
Steps:- 
1) Find out the reciprocal of each size (ie \) 1/x 


2) Add all the reciprocals of all values ( ) 
3) Apply the formula : Harmonic Mean 
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Illustration: ME | 
The monthly incomes of 10 families in rupees in a 


certain village are given below:- 





-— 
| 
U3 
N 
© 
O 
ho 
~J 
~] 
oo 


Calculation of H.M. — discrete series 


Steps:- 
1) Find out the reciprocal of each size of item 1/x 


2) Multiply the reciprocal 1/x of each size by its 


frequency f (1/x) 
3) Add up all the products f(1/x) 
4) Apply the formula H,M = N 





f (1/x) 
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Illustration : | 
Calculate H.M. from the following data: 





H.M - E ] 
f (1/x) 


34 
4.1319 


8.23 


Calculation of H.M. — continuous series 


Steps:- 
1) Find out the mid value of each class m 
2) Find out the reciprocal of each mid value 1/m 


3) Multiply the reciprocal of each Midvale by its 
frequency f(1/m) 
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4) . Add up all the products 
5) Apply the formula H,M 


Illustration : 


l 





/m) 
f 


N 
(1/m) 


Calculate H.M. from the following data : 


Solution 






6 [ts 


0.02857 
0.02222 
0.01818 
0.01534 


0.01333 
0.01176 


30-40 | 40-50 | 50-60 | 60-70 | 70-80 | 80-90 | 90-100 


Frequency | 15 | 13 | 8 — LT IS. 


f (1/m) 
0.42855 
0.28886 
0.14544 
0.09204 
0.19995 
0.08232 


0.06318 
f (1/m) = 
1.30034 


01.01053 








= 70 
1.30034 


z 53.83 
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In statistics, statistical dispersion (also called statistical 
variability or variation) is variability ore spread in a variable 
or a probability distribution. Common examples of measures 
of statistical dispersion are the variance, standard deviation 
and inter-quartile range. 

Dispersion is the measure of the variation of the 1tems:- 


Methods of measuring dispersion 


The following are the important methods of studying 


variation. 
(1) Range 
(ii) Quartile deviation i 
(111) Mean deviation 


(iv) Standard deviation 


RANGE: 

Range is the simplest dispersion It is defined as 
the difference between the value of the largest item and 
the value of the smallest item included in the distribution. 

Range =L- S 
Where L = Largest value 
S = Smallest value 


The relative measure corresponding to range is called the 


coefficient of range. 


Coefficient of range = L -S 
LES 
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Illustration 


Find the range of weights of 7 students from the 


following 
27, 31, 35, 36, 40, 42, 43 
Solution:- 
L = Largest value = 43 
S = Smallest value=27 
Range —L-—S 
= 43 — 27 = 16 


Coefficient of range = L — S = 43-27 
L+S 43+27 


= 16/70 = 0.23 
QUARTILE DEVIATION:- 


Quartile deviation is an absolute measure 
dispersion. 


Quartile Deviation = ỌQ3:-Ħ;ı 
2 


The relative measure of dispersion, known 


coefficient of quartile deviation, is calculated as follows : 


Coefficient of quartile deviation = Q3- Qi 
Q3 + Qi 
Where, Qi, Lower Quartile 
Q= Upper Quartile 
Formula for finding lower quartile Q, 


Qı =1+h (3-5 
f 4 


where, 1 — lower limit of the Q, class 


f — corresponding frequency of the Q; class 
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of 


as 


h — magnitude of the Q; class 


e — 1s the cumulative frequency of the class preceding 
the Q; class. 
Formula for finding upper quartile Q4 


where, 1 — lower limit of the Q; class 


f — Corresponding frequency of the Q; class 
h — Magnitude of the Q; class 


c - Cumulative frequency of the class preceding the Q3 
class. | 


Illustration :- 


Calculate the range and quartile deviation of wages 


ae 30-32 | 32-34 | 34-36 | 36-38 | 38-40 | 40-42 | 42-44 


toes | 2 |e |i | we | 2] 8 


Also calculate the quartile coefficient of dispersion 


Solution : Range = L — S = 44 — 30 = 14 Rupees 











Qı Class 


(5 Class 


86 = 21.5 
4 


(in c.f, just greater than 
21.5 is Q; 30 class) 





Where 1 = 32, f= 18, h= 2, c=12 
—32-- 2 (21.5— 12) 
18 


=32+ 2 (9.5) 
18 


— 32+ 19 

18 
= 32+ 1.06 
= 33.06 


3N — 3 x 86 — 64.5 
4 


(in c.f, just greater than 64.5 is 72 Q3 class) 


Q3 -l-h (az-e | 
f | 4 


I 


Where 1= 38, f= 12, = 2, 
60 
= 38+ 2 (64.5 — 60) 
12 
= 38+ 2 (4.5) 
18 
= 38 + 0.75 
= 38.75 
Quartile Deviation = Q3- Qi 
2 
— 38.75 — 33.06 
2 
= 5.69 
2 
= 2.85 
Coefficient of Quartile Deviation = Q5 - Qi 
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= 38.75 — 33.06 
38.75 + 33.06 
= 5.69 
71.81 
= 0.08 
Illustration :- 


Calculate the quartile deviation and coefficient of 


quartile deviation for the following:- 
enr — imim oism|e|mwim 


Number of members 132 153 140 


Solution: 










Qı Class 











th 
Q) = Value of | = l | item 
th 
= Value of Í 343 = i | tem 
th 
= Value of [ = | tem 
= 136"item 
= 40 years 


(& Just greater that 136 is 196 — Q; class) 
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(Qu = Value of 3 E Ntl } item 
th 

= Value of 3 & "s ] item 
th 

— Value of 3 [ - ) item 


Value of 3 x 136" item 
— Value of 408'* item 


= 60 years 
( Just greater that 408 is 489 — Q4 class) 
Quartile Deviation = Q3- Qi 
2 
= 60-40 
2 
= 20 
2 
= 10 years 
Coefficient of Quartile Deviation = Q5 - Qi 
Q3 + Qi 
= 60 - 40 
60 + 40 
x: 20 
100 
= 0.02 
3. MEAN DEVIATION OR AVERAGE 
DEVIATION 


Mean deviation is the arithmetic mean of the deviation 
of a series computed from any measure of central tendency, 
(i.e) mean, media or mode; all the deviation are taken as 
positive 

(1.e) + and — signs are ignored. 
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Coefficient of mean Deviation: 

Mean deviation calculated by any measure of central 
tendency is an absolute measure. 
For the purpose of comparing variation among different 
units, a relative mean deviation 1s required. 

The relative mean deviation or coefficient of mean 
deviation is obtained by dividing the mean deviation by the 
average 


Mean deviation about A=1  [|x— A| 
n 


Where A-mean or median or mode 


Coefficient of mean deviation = Mean deviation about 
A 


A 
In the case of frequency distribution 


Mean deviation about A=1  |x— A| 
n 


IHustration 
Calculate the mean deviation from mean for the 


following data 


Cmemena | xa | ss | se | swo 


Solution: 
| 2 
| 9 | 22 | 66 









P 


Mean = 
N 
= 52 
10 
= 5.2 
Mean deviation about A=1 |x—A| 
n Where 
A=Mean 
= 14.8 
5.2 
= 1.48 
Coefficient of mean deviation = Mean 
deviation about A 
Mean 
= 1.48 
= 0.28 
Illustration 


Find the mean deviation about median for the following 


519 | 10%- | 15- | 20- | 25- | 30- | 35- | 40- 
15 | 20 | 25 | 30 | 35 | 40 | 45 
cleisie|sle4lels 


Also compute the coefficient mean deviation about median 


distribution 

















No. of 
students 


of dispersion 
Solution:- 
We know that, 
Median  =1+ h (3-0 ] 
2 


Where 
1 — Lower limit of the median class 
f — Corresponding frequency of the median class 
h — Magnitude of the interval 


c - Cumulative frequency of the class preceding 


median class. 


Range = L— 5S = 44 — 30 = 14 Rupees 


f |x-A| 







D 
N 


^T 





fi | QO | Qo 
re 


— | n 
OO | W 
U | UA 
QNIN 


jee] | | l ES 
=336 
Here N= 49, N/2 = 24.5 


Thus the median class 1s 15-20 
1=15, f= 15, h= 5, c=11 


Median =]+ h [pt ] 


=15+_5 (245-11) 
15 


=15+ 5 (13.5) 15 
65 


15 
—154-4.5 
— 19.5 
Mean deviation about A—1  |x— A| 
n Where 
A=Mean 
= 336 
49 
= 6.85 
Coefficient of mean deviation = Mean deviation about 
Mean 
Mean 
= 6.85 
19.5 
= 0.351 


Standard deviation: 

A common measure of dispersion which is preferred 
in most circumstance in statistics is the standard deviation. 

It is denoted by O and defined as the square root of the 
average of square of deviation about mean 


The formula to find the standard deviation 1s 


| (x x 
FI 


In the case of frequency distribution standard deviation 


| f(x xf " 
Fl 


Coefficient of variation Standard deviation X 
100 


| 
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Mean 
= o X 100 
x 


Note : The formula of standard deviation can be simplified 


as, 
S.D 1 d 
n 
S.D 1 fd 
n 


Sometime we write the formula as 





© SD D e 
n n 
- Tu —— eG ea 
(ii) S.D 1 fd (4 M 
m N N 
C 


For comparing the variability of two sets of a frequency 
distribution we calculate the coefficient of variation for each 
of the distribution and that distribution having smaller 
coefficient of variation is considered as more consistent or 


more stable or more reliable then the other. 


Illustration: 
Calculate the mean and standard deviation from the 


following table giving the age distribution of 542 members. 


20.30 | 30-40 | 40-50 | 50-60 | 60-70 | 70-80 | 80-90 


No. of 


67 






Solution : 


Mean 


g 


to 
s 
UJ 
o 


00 
T 
t 
= 


| E 


542 542 


765 -225 


542 542 


A+ fd XC 
N 


55-c- 10 X(-15) 
542 


54.723 





, X 10 


765 --15 


X 10 


sos 


141.067 


11.877 
Illustration : 
The scores of two golfers X and Y in 12 rounds are 


given below. Who 1s the more consistent player.. 








Since coefficient of variance of the player Y is less. Y is the 


better player and more consistent player. 


Combined Standard Deviation:- 

If a sample of n; items has mean x, and standard 
deviation 0; and another sample has mean x2 and standard 
deviation O2, we can find out the combined mean and 
combined standard deviation by using the formula, 


Combined mean X,, mA, mX 


n n, 
Combined standard deviation = ni (Oi, di?) + 
m (92^ -- do”) 
I + N2 
Illustration:- 


The mean of 2 samples of sizes 50 and 100 
respectively are 54.1 and 50.3 and standard respectively are 
24.1 and 50.3 and standard deviations are 8 and 7. Find the 


mean and the standard deviation of the sample size of 150. 
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Solution:- 
Given that n= 50 n= 100 xj— 54.1 xo^ 503 
O= 8 O5— 7 


n, X, n, X, 
fos m 


Combined mean x 


50 x 54.1 + 100 x 50.3 


20 x 100 
= 51.57 
di = 54.1 — 51.57 = 2.53 
d» = 50.3 — 51.57 = -1.27 
Combined standard deviation = nı (Oj; 24+ dı) + 
n (O^ + d?) 
nı + n2 
= 50( ?+ (2.53)? +100 
SO x 100 
= 7.5637 


CORRELATION AND REGRESSION 
Definition: Correlation 
Correlation is an analysis of the relation between two or 


more variables. 
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Types of Correlation: 


Positive Correlation 
Two variables are said to be positively correlated if for an 


increase in the value of one variable there is a increase in the 
value of the other variable or for a decrease in one variable, there 
is a decrease in the other variable. That is, the two variables 
change in the same direction. 

For example, the prices of the product increase due to 
increase in quality. The salary of employees and the number of 
years of experience are positively correlated. 

Negative correlation 

Two variables are said to be negatively correlated if for an 
increase in the value of one variable there is a decrease in the 
value of the other variable. That is two variables change in 
opposite direction. 

For example: The number of children who receive Polio 
drops and the number of children affected by polio are negatively 
correlated. 

Also price and supply are negatively correlated. 

No correlation 

The variables are said to be uncorrelated if the change in 
the value of one variable has not effect with the change tn the 
value of the other variable. For example, In an organization the 
profit earned by the company and the age of the chairman of the 


company are not correlated. 
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Karl Pearson coefficient of Correlation 
Correlation coefficient between two random variables X 


and Y, usually denoted by (x,y) is defined as 


Cori, 
Ax, y) = Covix.y) 


C. Cyr 


Where cov(x,y) = = Exy —xy 


Gy = =- Ix? — x? 
Cy = Tay ire 
(OR) 


r (x ) = Ax —- XIV) 
n4 = X358. rty-y0 


The value ofr is lies between -1 and +1 
1. If r=+1 there is a perfect positive correlation between the 
two variables. 
2. If r—-1 there is a perfect negative correlation between the 
two variables 
3. If r=O implies there is no correlation between the two 
variables 
Illustration 1 


Calculate the correlation coefficient for the following 


heights (in inches) of fathers x and their son's y. 
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Solution 

















e. = _|— 5x? — (x)? — |? — (ea)? 
uu 37028 4624 
& 
— 2.121 
I E 328132 
Ox = Jia Q0? = j^ — (699? 
38132 
= ——— 4761 
= 2.345 
E 
rir. y) = 
Ox Fy 


=(375 60)—(68)(69- 
2.12127 2.345 


— 0.6030 
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Illustration 2 
Calculate the coefficient correlation between the two firms 


A and B from the following sales figure in thousands of rupees 


per week. 








REGRESSION 
Regression is a mathematical measure of the average 
relationship between two or more variables in terms of the 


original limits of the data. 


Lines of Regression 

If the variables in a bi-variate distributions are related we 
will find that the points in the scattered diagram will cluster 
around some curve called the curve of regression. If the curve is a 
straight line, it is called the line of regression and there is said to 
be linear regression between the variables. Otherwise regression 


is said to be curvilinear. 


(h) Line of regression of y on x is given by 
(x — x) = bxy(y — y) 
Ax—Xx»y—X) 
Where by bxy = p 
— Regression coefficient of y on x 
OY 
irs 


Where „= standard deviation of x 
y ^ standard deviation of y 
r — correlation of coefficient between x and y 
(11) Line of regression of x on y 1s given by 
(x — X) = bxy — y) 
Where bxy = aoo ooo 


Xy»)? 
— Regression coefficient of x on y 
mios 
ey, 


Where o,.=standard deviation of x 


c, —standard deviation of y 


r — correlation coefficient between x and y 


Note: Both the lines of regression pass through (x, #7) 
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Relation between correlation coefficient and regression 


coefficient 


A . Gc, 
We know that regression coefficient of y and x byx =r = 


and regression coefficient of x and y 
e. 
bxy = r~ 
Oy 


S. 
Ox Oy 


= byx x bxy =r 
= r% = byx x bxy 
r = LJjbyx x bxy 
Correlation coefficient 
r= +,/byx.bxy 
Illustration 
From the following data find, 
0) The two regression equations 
Gi) The coefficient of correlation between the marks in 
Economics and statistics 
(ii) The most likely marks in statistics when marks in 


Economic are 30 
Economics 
Statistics 


Solution 
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= -0.6643 


Regression coefficient of x on y 


Hx-X)w—y2 — -93 
bxy = "UP T —0.2337 


(i) Equation of the line of regression of x on y is 
(x — x) = bxy(y — y) 
x-32 = -0.2337 (y-38) 
= -0.2337y+0.2337 x 38 
(1e) x=-0.2337y + 8.8806 + 32 
© x--0.2231-40.8806 
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Equation of the line of regression of y on x is 
Cy — 320 = byx(x — x) 
Y -38—-0.6643 (x-32) 
=-0.6643x + 0.6643 x 32 
Ge) y = -0.6643x + 21.257638 
y = -0.6643x + 59.2576 
Gi) Coefficient of correlation 
r = 4+,/byx x bxy 
= + (—0.6643)-(—0.2337) 
= +0.1552 
= +0.394 
(iii) Now we have to find the most likely marks in statistic (y) 
when marks in economics (x) are 30. 
We use the line of regression of y on x 
(ie) y=-0.6643x + 59.2576 
Put x = 30 we get 
y = (-0.6443) x 30 + 59.2596 
= -19.329 + 59.2596 
= 39.9306 
~ 39 


Illustration 


In a correlation study the following values are obtained 





Coefficient of correlation 0.8 
Find the two regression equations that are associated with the 


above values. 
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Solution : 


By data, 

X = 65 y of 

Gy, = 2.5 o, = 3.5 
= 0.8 


(i) Line of regression of x on y 

(x —x)— ry — 4) 
vy 
2.5 

x — 65 — 0.8% —- (y — 67) 
x- 65 = 0.5714 (y-67) 
x=0.5714y — 0.5714 x 64 + 65 
x=0.5714y + 26.72 

(11) Line of regression of y on x 
(y -3)-9r£(x-x) 

3.5 

y—67 = 0.8% —-(y — 65) 
x-67 = 1.12 (x-65) 
y-67 — 1.12x — 72.8 
y—1.12x — 72.8+67 
y—1.12x — 5.8 


Illustration 
In a partially destroyed laboratory record of an analysis of 


correlation data, the following results only are legible. 
Variance of x=9 | 
Regression equation: 8 x- 10y4-66—0 
40x-18y = 214 
Find on the basis of the above information 
(i) The mean values of x and y 
(11) Coefficient of correlation between x and y 


(üi) Standard deviation of y 
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G) The mean values of x and y 


2x-10y=-66 (1) 
40x-18t=214 (2) 
(1)x 5 40x-50y—-30 
(2) 40x-183y — 214 
Subtracting -32y = -554 
y = —— — 17 


y-—1170rj4 — 17 
Substituting the value of y in equation (1) 
8x-10y—-66 
8x-10x17 = -66 
8x=-66+ 170 


8x—104 

x — 7 18 

x—13 or x = 13 
Mean value of x—x = 13; Mean value of y—y-—17, 
Gi) For finding out the correlation coefficient, we will have to 
findout the regression coefficient. 


Let us take equation (1) as the regression equation of x and 


8x-10y—-66 
7x=+10y-66 


bxy=y coefficient = = = 1.25 
And take equation (2) as the regression equation of y on x. 
40x-18y — 214 
-18y = 214-40x 
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—402x+ 214 





—ig 
40 —214 
Y = 18 42 


40 
byx = x coefficient = Ta 7 222 


~ r = djbyx x bxy = 41.25 x 2.22 > 1 
Our assumption is wrong. 


Hence we take, the equation (1) is taken as y on x. 





8x-10y—-66 
-10y = -8x-66 
_ —9x—6€ 
Y = 1-410 
28 XQ, $6 
Y = 318 tü 
The equation (2) is taken as regression eqn. of x in y 
40x-18t — 214 
40x=18y+214 
_ 18w+214 
ert: 
_ 18), 214 
^ — 402 ' “40 


bxy = y coefficient = -< 


18 £8 
~r = bxy x byx = — X — 
7 4n 10 


R=06 
Given variance of x=9 
Oo. =9 
Oy = 49 — 3 
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Regression coefficient of x on y 





40 oy 
(0.6) 
0.45 = 
yr 
4.2 
y 0 45 
co, —4 


Standard deviation of y=4 
SPREARMAN’S RANK CORRELATION 

In chapter on correlation and regression analysis, we 
introduced a collection of descriptive and inference techniques 
for analyzing the relation between two variables, a less formal 
non-parametric approach is often used to measure the strength of 
the relation between two variables. The spearman’s rank 
correlation co-efficient is a measure of association based on the 
ordinal feature of data. 

Among the various statistical methods based on ranks, the 
spearman's rank correlation procedure was the earliest to be 
developed. This method is simple to use and easy to apply. Also 
it has proved to be almost as powerful as its classical counter part 
the Pearson's produce moment correlation method under 
conditions favourable to the latter and even more powerful than 
the parametric method when its assumptions are violated. 

The spearman's coefficient of rank correlation R is given 


by the formula. 


6 XD? 
NW —N 





R = 1 — 
The coefficient of rank correlation is a relative measure 


which varies from -1 to +l. 
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The standard error of the rank correlation coefficient is 
given by 

The spearman rank correlation coefficient may be 
employed as a test statistic to test a hypothesis of no association 
between the two populations. We assume that pairs of 
observations (x,,y,) have been randomly selected and therefore 
the hypothesis of no association between the populations implies 
a random assignment to ranks within each sample. 

Each random assignment (for the two samples) represents 
a sample points associated with the experiment and a value of R, 
(stringer relationship between X and Y) could be calculated for 
each. Thus it is possible to calculate the probability that R, 
assumes a large positive or negative value due solely to chance 
and thereby suggests an association between populations when 
done exists. 
SPEARMAN’S LIMITATIONS OF NON-PRAMETRIC 
TESTS 

1. It should be noted that the statistical methods which 

require no assumptions about the populations for which 

we are sampling are usually less efficient (or powerful) 

than the corresponding standard techniques. Assertions 

made with equal confidence require larger samples if they 

are made without knowledge of the form of the underlying 

distribution than if they are made with such knowledge. 

It is generally true that the more we assume, the 
more we can infer from a sample. However, at the same 
time the more we assume the more. We have to be 
cautious about the applicability of the methods and the 


inferences based on them, it is disadvantages to use the 
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non-parametric methods when all the assumptions of the 
classical produces can be not and the data are measured 
either on interval on ratio scale. 

2. As the sample size gets larger, data manipulations required 
for non-parametric procedures are sometimes laborious 
unless appropriate computer software is available. 

3. A collection of tabulated critical values for a variety of 
non-parametric tests under situations dealing with small 
and large n is not readily available. 

Rank Correlation 


Spearman Rank correlation 


6 rp? 


Coefficient R = 1 — —— 
N?—N 


Where D=R-R> 
De denotes difference between the ranks for each pair of 
observations. 


Lilustration 


Find the rank correlation coefficient from the following 





6 XD? 
Ranks correlation coefficient R = 1 — BELA 
6220 
79—7 
120 


LE d 


RRA 





orm 
— 


= 1-0.3571 
= 0.6429 


Illustration 4 


Ten competitors in a musical test were ranked by the 3 


judges x,y,z in the following order 





Using rank correlation method, discuss which pair of judges has 
the nearest approach to common links of music. 
Solution 

R, = Rank by x 

R5 = Rank by y 








Rank Correlation between R; and R5 








2 
R = i a 
NÉL-N 
uH 6x206 
^ "* 4000—16 
_ 1200 
~ ^ agn 
= ]-1.212 
= -0.212 
Rank correlation between R, and R, 
x 
NP om 
N?—N 
_ 4 _ 5x214 
1nnn—1n 
= 1 — LL = 1 — 1296 = —0.296 


996 


The rank correlation coefficient between R, and R4 





5x60 
1aD5n-—TiTfh 
3260 


m 990 


= ]-0.3636 

= 0.6364 
Since the rank correlation between R1 and R3 is maximum and 
also positive, we conclude that the pair of find x and z has the 


nearest approach to common links in music. 
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Illustration 5 
Find the rank correlation coefficient for the following data 





Solution 
Let x = Marks in Tamil 
y = Marks 1n English 
R, = Rank in x 
R; = Rank in y 





QN 
Uu 
UJ) 


~ UJ 


ON 
N 
ON 


UJ 


OQ 
o 


No 


“J | 





D’=102 


Rank correlation coefficient 





6D? 
R = oS 
A> AZ 
6x102 
— 4 — ——— 
1000—10 


838 


^. 990 

— 1-0.618 

— 0.382 
Repeated Ranks 

In any two or more individuals are equal in any 

classification with respect to characteristic. A or B or if there is 
more than one item with the same value in the series then spear’s 
man formula for calculating the rank correlation coefficients 
breaks down. In this case common ranks are given for repeated 
ranks. This common rank is the average of the ranks which these 
items would have assumed if they are slightly different from each 
other and the next item will get the ranks next to the ranks 
already assumed. As a result of this, following adjustment or 


correction is made in the correlation formula. 
: 2x t 
In the correction formula, we added the factor mo to 


d^ where m is the number of item is repeated. This correction 
factor is to be added for each repeated values. 


Illustration 6 


A sample of 12 fathers and their eldest sons have the 
following data about their heights in inches. 


Father | 65 | 63 | 67 | 64 | 68 | 62 | 70 | 66 | 68 | 67 | 69 | 71_ 
Height 

Sons 
eem | 68 | 66 | 68 | 65 | 69 | 66 | 68 | 65 | 71 | 67 | es | 70 


Calculate the rank correlation coefficient 






Let x = fathers height 
y = sons height 
R, = Rank in father’s height 
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R- = Rank in Son’s height 





Correction Factors 


In x series 68 is repeated twice 


an(erz*—i} 2(27-i} 2f4-1) 2x3 i 


C.F = Ã— er A gw o——— — —M m o 
T2 TZ tA ËZ 3 


In x series 67 is repeated twice 


C.F = = LE 
12 12 2 
In y series 68 is repeated four times 
ane (are? — i 4A(49—1 4r16—1 4x15 
Qmm V TIS, M a 
i2 12 12 12 
In y series 66 1s repeated twice 
=É 2(27—1 
C.F = m(m*—1} _ 2{2*-1) 1 
12 12 z 


f 1 i 
Sum of correction factors — m + 5 +5 + + = = 7 


Rank correlation coefficient 
R=1 S| ID? +sum of correction factors] 
ae — 2 SS oF correction j actors] 
N? LN 
6[72.5--7^ 
—1— ———— 
122—122 
O 6[79.5] 
1728—12 


90 


477 
|. 1716 
= 1-0.277 


= 0.722 





SUMMARY 

Statistical methods are frequently used for taking good and 
reliable decisions in management. On the way the measures of 
central tendency and measures of dispersions are the fundamental 
platforms for getting more knowledge on correlation, regression 
and various statistical measurements. This chapter described 
about various measures of central tendency and dispersion. A 
measure of location is a value used to describe the centre of a set 
of data. The arithmetic mean 1s the most widely reported measure 
of location. 
KEYWORDS 
The keywords of this unit are 
Mean 
Median 
Mode 
Harmonic mean 
Geometric mean 
Correlation 
Rank correlation 


Regression 


Check Your Progress Questions: 


1. Calculate the rank correlation coefficient for the following data 


reson |a [= | © | P | e 


am 
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2. Compute spearman’s rank correlation coefficient for the 


following data 





3. Two judges in a beauty competition rank for the 12 entires as 


follows : 


statistics 
Maths 


5. Find the coefficient of correlation between industrial 





production and export using the following data. 


Productions | 53 | s6 | s& | se | 0 | 6o | &- 


6. The two regression equations of the variable x and y are 
x=19.93-0.87y and y = 11.64 — 0.50x find 
(i) Mean of x (11) Mean of y 


(111) Correlation between x and y 










7. Find the regression line of y and x if 





8. Find the regression lines for the following data 
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of father 
if uon 





Find the two regression and calculate the expected average 
height of the son when the height of the father 1s 164. 
Questions/ Exercise 
1. What 1s meant by measures of dispersion? 
2. Explain the characteristic of Karl pearson coefficient of 
correlation. 
3. Enlist the uses of regression. 


4. Differentiate regression and correlation. 
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UNIT 3 : PROBABILITY AND THEORITICAL 
DISTRIBUTIONS 


INTRODUCTION 
Let a coin be tossed. Nobody knows what will get: 


whether a head or tail. But it is certain that either a head or tail 
will occur. In a similar way, if a dice is thrown, we many get any 
of the faces 1, 2,3,4,5 and 6. But nobody knows which one will 
actually occur experiments of this type where the outcomes 
cannot be predicted are called ‘random’ experiments. 

The word probability or chance is used commonly in day 
to day life. For example the chances of India and Pakistan 
winning the world cup cricket, before the start of the game are 
equal (ie 50:50). It is likely that Mr. Anbu may not come for 
taking his class today. We often say that it is very probable that it 
will rain tomorrow. Probably Mr. Anu will not come to tea party 
tomorrow. All these terms, chance, likely, probable etc., convey 
the same meaning, i.e. that event is not taken to take place. In 
other words, there is an uncertainty about the happening of the 
event. 

In such of these cases we talk about chance or probability 
which 1s taken to be a quantitative measure of certainty. 

The term probability refers to the study of randomness and 
uncertain. Before going to study the mathematical definition of 


probability we will define some terms of given below. 
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UNIT OBJECTIVES 

After completing this unit, you should be able to calculate 
the probability of an event 

to apply addition theorem of probability 

to apply multiplication of probability 

to calculate the expected value 

to calculate the probability of an event using binomial 

distribution 

to calculate the probability of an event using poisson 


distributor. 


UNIT STRUCTURE 
Mathematical probability 
Calculation of probability 
Addition theorem of probability 
Multiplication of probability 
Conditional probability 
Expectation 
Binomial distribution 
Poisson distribution 
Normal distribution 


Exercise 


TRIAL AND EVENT 
Consider an experiment of throwing a coin. When tossing 


a coin, we may get a Head (H) or Tail (T). 


EXHAUSTIVE EVENTS 
The total number of possible outcomes in any trial is. 


known as exhaustive events. 
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Example 
In tossing a coin the possible outcomes are getting a head 


or tail. 


Hence we have 2 exhaustive events in throwing a coin. 


MUTUALLY EXCLUSIVE EVENTS 


Two events are said to be mutually exclusive when the 
occurrence of one affects the occurrences of the other. 
Example : 

In tossing a coin the events head or tail are mutually 
exclusive, since both tail and head cannot appear in the same 


time. 


EQUALLY LIKELY EVENTS 
Two events are said to be equally likely if one of them 
carnot be expected in preference to the other. 
Example : 
1. In tossing a coin, head or tail are equally likely events. 
2. In throwing a die, all the six faces are equally likely 


events. 


INDEPENDENT EVENTS 

Two events are said to be independent when the actual 
happening of one does not influence in any way the happening of 
the other. 
Example : In tossing a coin, the event of getting a head in the first 
toss is independent of getting a head in the second toss, third toss, 


etc. 
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Mathematical or Apriori definition of probability 

Let S be a sample space (the set of all possible outcomes 
which are assumed equally likely) and A be an event (a sub-set of 
S consisting of possible outcomes) associated with a random 
experiments Let N(S) and N(A) be the number of elements of S 
and A. Then the probability of event A occurring denoted as 
P(A), 1s defined by 
Number of cases favourable to A 


P(A) — : 
Exhaustive number of cases m S 


| n(A) 
~ nF) 
Illustration 1 : 





From a bag containing 10 red and 20 white balls, a ball is 
drawn at random. What 1s the probability that it 1s red. 
Solution : 

Total number of balls in the bag = 10020 = 30 

Number of red balls — 10 

Let A be the event of drawing a red ball 
Number of favourable cases 


1 PU = Total number of equally likely cases 


Find the probability of throwing : (a) 4 (a) an odd number 


(c) an even number with an ordinary die. 


Solution : 
Sample Space = S = {1,2,3,4,5,6} 
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a) When throwing a die there is only one way of getting 4. 
number of favourable events 


A ae number of exhansive events 
1 
6 
b) Sample space S = {1,2,3,4,5,6} 
ncS) = 6 
Let A be event of getting an odd number 
A = {1,3,5} 
n(A) = 3. 


P(A) number of favourable cases 
= Total number of equatiy izkely cases 
n(A} 


| n(S) 


MIm OI 


c) Sample space S—1(1,2,3,4,5,6? 


n(S) — 6. 
Let B be event of getting an ever number 
B = {2,4,6} 
n(B)=3 


PODE number of favourable cases 
|. Total number o f equality likely cases 
rH) 


nS) 


Nim OI 
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Illustration 3 

What is the probability that a leap year selected at random 
will contain 53 Sundays? 
Solution : A leap year consists of 366 days. 

366 days = 52 full weeks + 2 days 

= 52 x 7+ 2 days 

(52 full weeks we have 52 Sunday definitely) 

These two days may be 

1. Monday, Tuesday 

11. Tuesday, Wednesday 

111. Wednesday, Thursday 

1v. Thursday, Friday 

v. Friday, Saturday 

saturday, Sunday 

vil. Sunday, Monday 
of these 7 cases the last two cases contain Sunday and hence we 
have 2 favourable cases. 

Number of favourable cases 


+ The required probability = ———————————————————— 
k F ? Total number of equally liked cases 


«| b 


AXIOMATIC APPROACH TO PROBABILITY 

Let S be the sample space and A be an event associated 
with a random experiment. Then the probability of the event A, 
denoted by P(A) is defined as a real number satisfying the 
following axioms. 


1) O<P(A)<]1 


ii) P(S)-1 
111) If A,,AÀ5,... A, are a set of ‘n’ nutually exclusive events, 
P(A; A2 ... An) = P(A,) + P(A5) + .....* P(A4) 
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Results 

1. Probability of impossible event is zero. (ie) PC )=0 

2. Probability of the complement event A of A is given by 
P(A) = 1 — P(A} 


Illustration 4: 

A ball is drawn at random from a box containing 6 red 
balls, 4 white balls and 5 blue balls. Determine the probability 
that it is (i) Red (ii) White (iii) Blue (1v) not red. 


Solution : 


Probabil z Number of favourable cases 
Lt — ee a AAS 
robabtlity of an even M perire qun emm PA rrr eene 


G) P(a red) = = = 0.40 
(ii) P(a white) = — = 0.267 
Gi) P(a blue ball) = — = 0.333 
Gv)  P(not red) = 1 — P(a red) 
— 1— 0.40 
= 0.60 


Illustration 5 

The probability that A can solve a. Problem is 0.7 and the 
probability that B can solve that problem is 0.6. Considering that 
these two events are independent, Find the probability that the 
problem gets solved by either of them. 


Solution 
P(A) + P(A) =1 
P(success) + PCfailure) = 1 
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P(failure) = 1-P(success) 
P(A tails to solve the problem) 


1-P(A solve the 


problem) 
= 1-0.7 
= 0.3 
P (B fails to solve the problem) = 1-P(B solve the problem) 
= 1-0.6 
= 0.4 


Since the events are independent, the probability that both fail to 
solve the problem 
= 0.3 x 0.4 
= 0.12 
The probability that the problem will be solved = 1-0.12 


Illustration 6 
A bag contains 6 white, 4 red and 10 black balis. Two 


balls are drawn at random. Find the probability that they will both 
be black. 


Solution : 
Total number of balls in the bag = 6+4+10 = 20 


Two balls are drawn at random 


P(both will be black) = TA 
10x9 
1x7 

20x19 
1x2 
90 

2 

380 
2 





101 


90 2 


- 2 ™ 380 
9 

^ 38 

= 0.237 


THEOREMS OF PROBABILITY 

There are two important theorems of probability. They are 

1. The addition theorem of probability 
2. The multiplication theorem of probability 

1. Addition theorem of probability 
(a) The addition theorem states that 1f two events A and B are 
mutually exclusive the probability of the occurrence of either A 
or B is the sum of the individual probability of A and B. 

Symbolically, PCA or B) = P(A) + P(B) 

The theorem can be extended to three or more mutually 
exclusive events 

Thus PCA or B or C) = PCA) + PGB) + PCC) 
(b) For any two events A and B, the probability that either A or B 
occurs 1s given by the sum of individual probabilities of A and B 
less the probability of simultaneous occurrence of the events A 
and B. 

P(A B) = PCA) + PCB) — P(A B) 

G.e) PCA or B) = P(A) + PGB) — P(A and B) 
(c) Multiplication theorem of probability 

This theorem states that if two events A and B are 
independent, the probability that they both will occur is equal to 
the product of their individual probability. 

Symbolically, If A and B are independent 

Then PCA and B) = P(A) x P (B) 
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The theorem can be extended to three or more independent 


events. Thus 


P(A,B and C) = P(A) x P(B) x P(C) 


Illustration 

One card is drawn from a standard pack of 52. What 1s the 
probability that 1t is either a king or a queen. 
Solution 

Total number cards in a pack = 52 cards n($)=52 


Let A be the event of drawing a king n(A) — 4. 
TAI E 

nis) 52 
Since the events are mutually exclusive. 


P(A or B) = P(A) + P(B) 


P(drawin,g a king) = P(A) = 





A 4 
= 32 52 
444 

^ 82 
8 2 
^ 82 13 


Illustration $ 
The managing committee of vaishall welfare association 

formed a sub-committee of 5 person to look into electricity 
problem. Profiles of the 5 persons are 

1. Male age 40 

2. Male age 43 

3. Female age 38 

4. Female age 27 

5. Male age 65 
If a chairperson has to be selected from this, what is the 


probability that he would be either female or over 30 years 
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Solution 


P(A) = P(female) = 


U Le e i 


P(B) = P(over 30) = 


P(A and BE) = Pt female and over 30) 
1 


5 
By addition theorem of probability 


P(A or B) = P(A) + P(B) — P(A and B) 
P(female or over 30) = P(female) + P(over 30) — P (female and 
over 30) 


= 1 
Illustration 9 
A coin is tossed thrice. What is the probability of getting 2 
or more heads. 


Solution : A coin is tossed three times, then we have 


H T 
AL CX fo S P ab." ZON 
H T H T H T H T 


S—Sample space = {HHH, HHT, HTH, HTT, THH, THT, TTH, 
TTT} 


P( getting 2 heads) = 


Ol me oiu 


P( getting 3 heads) = 
104 


As A and B are mutually exclusive, 

P (getting 2 or more heads) 
= P(getting 2 heads or 3 heads) 
= P(2 heads + P (3 heads) 


Illustration 10 


A bag contains 30 balls numbered from 1 to 30. One ball 
is drawn at random. Find the probability that the number of the 
ball drawn will be multiple of (a) S or 7 (D) 3 or 7 
Solution : Total number of balls — 30 1n a bag 


(a) PCA) = P(number being multiple of 5) 
— P(5,10,15,20,25,30) 
6 
^. 30 
P(B) = P@umber being multiple of 7) 
= P(7,14,21,28) 
al 
|. 30 
Since the events are mutually exclusive, by addition theorem of 


probability 


P(A or B) = P (multiple of 5 or multiple of 7) 
= P(A) + P(B) 
d 4 
= zo 30 
6+4 
^. an 
10 
^— 30 


(b)P(A) = P(number being multiple of 3) 


— P(3,6,9,12,15,18,21,24,27,30) 
10 


30 
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P(B) = P(number being multiple of 7j 


— P(7,14,21,28) 
4 


20 
Since Z1 is a multiple of 3 wells as 7 

P(A and B) = P(multiple of 3 and multiple of 7) 
1 
20 


By addition theorem of probabihty, 
P (multiple of 3 or multiple of 7) 
= P(A or B) 
= P(A) + P(B) — P(A and B) 
10 4 1 
— so ' 30. 30 
10 +4— 1 
||. 80 
13 


20 
Illustration 11 


The probability that a boy will get a scholarship is 0.9 and 


that a girl will get is 0.8. What is the probability that atleast one 
of them will get the scholarship? 
Solution : 
P(A) = P(a boy will get a scholarship) = 0.9 
P(B) = P(a girl will get a scholarship) = 0.8 
P(A and B) = P(A) x P(B) 
= 0.9 x 0.8 
= 0.72 
By addition theorem of probability 
P(A or B) = P(A) + P(B) - P(A and B) 
= 0.9 + 0.8 — 0.72 
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= 1.7 — 0.72 
| — 0.98 
Illustration 12 
A class consists of 80 students, 25 of them are girls and 55 
boys, 10 of them are rich and remaining poor, 20 of them are fair 
complexioned what is the probability of selecting a fair 
complexioned rich girl’? 


Solution : Number of students in a class = 80 


P(A) = P(selecting a fair complexioned person) = = 
P(B) = P(selecting a rich person) = m 
PC) = P(selecting a girl) = = 


Since the events are independent, by multiplication 
theorem of probability P(Selecting a fair, complexioned and rich 
girl) 

= P(A and B and C) 

= P(A) x PCB) x P(C) 


20 10 £25 


m So? 30°" 20 

= 0.0098 
Illustration 13 

A university has to select and examiner from a list of 50 
persons, 20 of them women and 30 men, 10 of them knowing 
Hindi and 40 or not, 15 of them being teachers and the reaming 
35 not. What is the probability of the university selecting a Hindi 


knowing women Teacher? 


Solution 
Total number of person in a list = O 


P(A) = P(Selecting a Hindu knowing candidate) 
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P(C) = P (selecting a teacher) 
15 
^. 50 
Since the events are independent, by multiplication theorem of 
probability 
P(selecting Hindi knowing woman teacher) 
= P(A and B and C) 
= P(A) x P(B) x P(C) 


10 20 15 
^ 50 SO 50 
3 
^ 128 
— 0.024 


Illustration 14 

A man wants to marry a girl having qualities; where 
complexion — the probability of such a girl is one in twenty; 
handsome dowry-the probability of getting this 1s one is fifty; 
Westernized manners — the probability here is one in hundred. 
Find out the probability of his getting married to such a girl when 


the possession of these three attributes are independents. 


Solution 


1 

P(A) — P(a girl with white complexion) — EN 0.05 
: : 1 

P(B) = Pla girl with handsome dowry) = sg 7 0-02 


P(A) — P(a girl with westernised manners) — igg ^ 2:91 
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Since the events are independent, by multiplication theorem of 
probability 
P(getting all these qualities) 
= P(A and B and C) 
= P(A) x P(B) x P(C) 
= 0.05 x 0.02 x 0.01 
= 0.00001 
Illustration 15 
The probability that machine A will be performing an 
usual function in 5 years time is 1/4 while the probability of 
machine B will still be operating usefully at the end of the same 
period is 1/3. Find the probability that both machines will be 
performing an usual function. 


Solution 


PCA) = P(machine A operating usefully) = 


U) | bed ote | pt 


P(A) = P(muachtne B operating usefully) = 


By multiplication theorem of probability, P(both A and B will 
operate usefully) 
= P(A and B) 


= P(A)x P(B) 


» er | 
= = y 
4 3 


i 


12 
Illustration 16 


A bag contains 8 white and 10 black balls. Two balls are 
drawn is succession. What is the probability that first is white and 


second 1s black. 
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Solution 
Total number of balls in a bag = 8+10=18 


g 
P(A) = P(drawing a white ball} = — 
G 
PCB) = P(drawing a black ball) = = 


Since the events are independent, by multiplication theorem of 


probability 
P(A and B) = P(A) x P(B) 
_ 2 Q2 
^ i12" 18 
_ 20 
^— 1&8 
— 0.247 


Illustration 17 
A problem is mathematics is given to 3 students A,B,C 


; 1*1 ; 
whose chances of solving it are S A. respectively. What 1s the 


probability that the problem will be solved? 
Solution | 


P(A will solve the problem) — - 

P(A will not solve the problem) — 1 — = == 
P(E will solve the problem) = = 

P(B will not solve the problem) = 1 — - = 
P(B will solve the problem) = = 


P(P will not solve the problem) = 1 — 5 = : 


P(all the three will not solve the problem) =  xIx- 
1 


a 
P(all the three will solve the problem) 
= 1-P(all the three will not solve the problem) 
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illustration 18 
Give two independent events A and B such that P(A)=0.2 
and P(B)=0.5 Find 
(i) P(A and B) (ii) P(A or B) (iii) P(A and Not B) 
Given P(A) = 0.2 
PCB) = 0.5 
Since A and B are independent 
(1) P (A and B) = P(A) x P(B) 
= 0.2 x 0.5 
= 0.1 
(11) P (A or B) = P(A) + PCB) — P(A and B) 
= 0.2 x 0.5-0.1 
= 0.7-0.1 
= 0.6 
(111) P (not B) = 1-P(B) 
= 1-0.5 
= 0.5 
P (A and not B) = P(A) x P(not B) 
= 0.2 x 0.5 
— 0.1 


Definition | 
The conditional probability of A given that B is defined as 


"OE 


2(b) multiplication rule of probability 


The probability of simultaneous occurrence of two events 


A and B denoted by P(A B) and defined by 
111 


P(A B) = P(A) x P(B/A) 


Illustration A 
From a bag containing 3 red and 4 black balls two bails 


are drawn in succession, without replacement. Find the 
probability that both the balls are (i) red (ii) black (iii) of the 


same colour. 


Solution 
i) Let A= {first ball drawn 1s red} 
B = {second ball drawn 1s red} 
Then A B = {both balls are red} 
Now P(A) — = Now P(B/A) = = 


— P CARB) = PCA)P (=) = x= 


11) Let C = {first ball drawn is black} 
D = {second ball drawn 1s black} 
Then C B= {both balls are black} 


z$ 3 
Now P(C) m Now P(D /€) ^ . 
P(C BO =O P(D/C) 
a .2. = 
|») 7".8 7 


131) When balls are o the same colour either both are red or both 
are black and the two cases are mutually exclusive. 

=P[(A B)or(C Dj] 

= P(A B)+ P(C D) 
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Hilustration 20 
A bag contains 5 white and 3 black balls. Two balls are 
drawn at random one after the other without replacement. Find 
the probability that both balls drawn are black. 
Solution 
Probability of drawing a black ball in the first attempt is 
P(A) — 3 3 


5+3 8 





Probability of drawing the second ball given that the first 
ball drawn is black 


E 2 2 
PG) = car} | 
lhe probability that both balls drawn are black is given 
by 


P(AcB) = PCA).P(B/A) 
z = 3 


= — x- = — 


e^7 28 


MATHEMATICAL EXPECTATION 

The concept of mathematical expectation is of great 
importance in statistical work. The mathematical expectation 
(also called the expected value) of a random variable is the 
weighted arithmetic mean of the variable, the eights used to find 
the mathematical expectation are all the respective probabilities 


of the values that the variable can possibly assume. 


Definition 

gf x denote a discrete random variable which can assume 
the values of x;,,x5,...x,, with respective probabilities P,,P5, ....P, 
where P|+P2+....P,=1, the mathematical expectation of x denoted 
by E(x) 1s defined as 

ECX) = PX, + P2X2 + .... + P4Xn 
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Illustration 21 
A petrol pump proprietor sells on an average Rs. 70,000 


worth of petrol on rainy days and an average of Rs. 90,000 on 
clear days. Statistics from the meteorological department show 
that the probability is 0.7 for clear weather and 0.3 for rainy 
weather on coming Sunday. Find the expected value of petrol sale 
on coming Sunday. 
By data, 
X, = 70,000, P, = 0.3 
X- = 90000, P; = 0.7 
Expected value of petrol sale on coming Sunday is 
ECX) = PiXı + P2X%2 
= 0.3 x 70,000 + 0.7 x 90,000 
= 21,000 + 63,000 
= Rs. 84,000 


Illustration 22 


A dealer in refrigerator estimates from his past experience 


the probabilities of his selling refrigerator in a day. These are as 


follows 
No. of refrigerators sold 1n : O 1 2 3 4 5 
a day ° 
Probability  : 0.03 0.20 0.23 0.25 0.12 
0.10 0.07 
Find the mean number of refrigerators sold in a day. 
By data, Xı=0, X-2=1, X3=2, X4=3, X;=4, X6—5, X-=6 
P,—0.03 P2=0.20 P3=0.2 P-4—0.25 P;=0.12 P.=0.10 
P;—0.07 


= OQ + 0.2 + 0.46 + 0.75 + 0.48 + 0.5 + 0.42 
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= 2.81 
= 3. 
Illustration 23 
The probability that a man fishing at a particular place will 


catch 1,2,3,4 fish is 0.4, 0.13, O.2 and O.1 respectively. What is 
the expected number of fish caught? 


By data, X;-1, X3—2, X3=3, 
X4-4 

P,—0.4 P5—0.3 P,—0.2 
P4—0.1 


Expected number of fish caught would be given by 
EC) = P,X, + P2X2 + P3X3 + P4X14 
= 0.4x1 + 0.3x2 + 0.2x3 + O.1x4 
= 0.4 + 0.6 + 0.6 + 0.4 
= 2 
Illustration 24 
A salesman wants to know the average number of units he 
sells per sales call. He check his past sales records and comes up 
with the following probabilities 
Sales in units : O 1 2 3 4 5 
Probability  : 0.15 0.20 0.10 0.05 0.30 0.20 
What is the average (expected) number of units he sells per sales 
call. 
Solution : 
By data, X,=0 X2=1 X3=2 X4=3 X>5=4 
Xg—5 
P,—O.15 P;—0.20 P3=0.00 P4=0.05 
P,—0.30 P6=0.2 | 
E(x) = PX, + PoX. + P3X%3t P4X4 + P5X5+ PoXs 
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— 0.15x0-0.20 x 140.10 x 2+0.05x3 + 0.30x4 + 
0.2x5 

= 0 + 0.2 + 0.2 + 0.15 + 1.2 + 1.0 

= 2.75 


Thus he would expect to sell 2.75 or 3 units on each sales 


Call. 


Illustration 25 
A consignment is offered to two firms X and Y for Rs. 


1,00,000. The following table shows the probability at which the 


firms will be able to sell 1t at different prices 


Selling price (Rs.) 
Probability 
80,000 90,000 1,05,000 1,10,000 
x 


Which firm X or Y will be more inclined towards the offer 







Solution : Firm X 
X,—80,000  2X5,—9,000 X3—1,05,000 
X4=1,10,000 
P,—O.2 P5—0.3 P3=0.4 P4—0.1 


E(X) =P,X, + P2X; + P3X3 + P4X4 

= 0.2 x 80,000 + 0.3 x 90,000 + 0.4 x 1,05,000 + 0.1 x 
1,10,000 

= 16,000 + 27,000 + 42,000 + 11,000 = 96,000 
Firm Y X,-80,000 X.=90,000 X3~1,05,000 
X4—1,10,000 


P,—0.25 P5—O.2 P3;=0.5 P4—0.05 
ECY) = P,X, + PoX, + P3X3 + P4X, 
= 0.25 x 80,000 + 0.2 x 40,000 + 0.5 x 1,05,000 + 0.05 x 
1,10,000 
= 20000 + 18000 + 52500 + 5500 = 96,000 
Since the expectation is the same, both the firms would be 
equally inclined towards the offer. 
Illustration 26 
There are three alternative proposals before a businessman 
to start a new project. 
Proposal A: Profit of Rs. 5 Lakhs with a probability of 0.6 ora 
loss of 
Rs. 80,000 with a probability of 0.4. 
Proposal B : Profit of Rs. 10 Lakhs with a probability of 0.4 ora 
loss of 2 Lakhs with a probability of 0.6 
Proposal C : Profit of Rs. 4.5 Lakhs with a probability of 0.8 or a 
loss of Rs. 50,000 with a probability of 0.2. 
If he wants to maximize the profits and minimize the loss, which 
proposal should be prefer. 
Solution 


Mathematical expectation of each of the proposal ECX) = 
r 1-81 +P2X> 


Proposal A: X,=5,00,000 X, = -80,000 
P,=0.6 P5 — 0.4 
Expected value = PX, + PorX2 


= 0.6 x 5,00,000 x 0.4 x (-80,000) 
= 3,00,000 — 32,000 
= 2,68,000 


Proposal B: 


X; = 10,00,000 X, = -2,00,000 
P, — 0.4 P5—O.6 
Expected value E(X) = P,X, + P2X2 


— 0.4 x 10,00,000 + 0.6 x (-2,00,000) 
— 4,00,000 — 1,20,000 = 2,80,000 


Proposal C 
X1 = 4,500,000 X2 = -50,000 
(Loss) 
P1=0.8 P2 = 0.2 


Expected value EX = P,X, + P2X2 
= 0.8 x 4,50,000 + 0.2 x (-50,000) 
= 3,60,000 — 10,000 
= 3,50,000 
Since expected value is heighest in the case of proposal C, 
hence he should prefer proposal C. 
Baye’s Theorem 
Statement : Let A;, A>,....A, be n mutually exclusive events and 


B be any event then 
P(ADP (B/A: 
PAM) = y ECL FO/AD 
Proof 
Let A;,A2,....A, be n mutually exclusive events and B be 
any event. | 
A, A2 .... An-S where S-sample space and hence 
B=B S 
= BCA; Ao... A.) 
—(B A) (B As3)..(B Ay) 
=(A B) (A2 B)... (An B) 
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Now since A;, A»,....A, are mutually exclusive so are the events 
(B A), B Ag,...(B A). 
P(B) —P(A, B)+P(A2 B)+....+P(A, B) 


= Y P(A; ^B) 


j=1 


= > P(A,)P(B/ Aj) 


j=i 

Thus for a fixed J, 
ib (2) _ PAB) 
B P(H) 
2L P(AjJPGI/A) 
LTP(A,P(B/A;) 

Note : 
(1) Let Aj, A; be 2 mutually exclusive events and B be any event 


then 
: PCADP(B /A1) 
i) P(A, |B) = : 
(i) PL I ) P(A, PCB / A, )--PCAS)PCB// A.) 
P(A DPB fA) 


ii) P = 
(1) (Az Ig) PCA, JPCB/ÀA,O)FPCA-)PU(B/ AS) 


b) Let Aı, A2, A; be 3 mutually exclusive events and B be any 


event then 
PKA PCBS A4) 
i I 
(i) P{A, |B) P(A, ) PCB / Ay )+PUA2) P(BsAg)+P(Ag) PCB/Ag) 
- PUALIPUE fA) 
UUENA- IBI ee a 
COUPLAND IS LE se nt ee 


eee B = ——————————————————————— 
(ii) P(A, |B) P(A, )P(B/A,)+P(A,)P(B /An)+P (Ag) PCB / Aq) 


Illustration - 

In a bolt factory machines A,B and C manufacture 
respectively 25%, 35% and 40% of the total of their output 5,4 
and 2 percent are defective bolts. A bolt is drawn at random from 
the product and is found to be defective. What is the probability 


that it was manufactured by machines A,B and C? 
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Solution 


Let Al the event of drawing a bold produced by first 


machine 
A2 = the event of drawing a bolt produced by second 


machine 

A3 = The event of drawing a bold produced by third 
machine 

B = the event of drawing a defective bolts. 


Then from the first information 


Prior probability 
25 
PCA,) = 25% = a 0.25 
35 
PLA) = 35% = car 0.35 
P(A) = 40% = — = 0.40 


Conditional probability 
P(B/A4) = 5% — -— = 0.05 


4 
P(B J A2) = 4% = —— = 0.04 


P(B/Aa) = 2% = — = 0.02 


10 

Joint probability 

PA.) P(B/A,) = 0.25x0.05 = 0.0125 

P(A.) P(B /Az) = 0.35x0.04 = 0.0140 

P(A.) P(B/A4) = 0.40x0.02 = 0.0080 
By Baye's theorem 
Posterior Probability 

P(A,)P(B/A,) 
P(A, PCB /Az1) + P (As) P(B/Az) + PCA)P(B / A4) 
- D.01z5 
n.0129$--ni1320--nangsn 


0.0125 
^. 0.0345 


PCA,IB) = 
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= (0.362 
PCA,)P(B A>) 
PCA,DP(B/A,)-- P(A2)P(B/Az) + PCADP(B /A4) 
3.0140 
D.0179--040140-niüaaaan 


G.014¢ 
6.0345 


= 0.406 





PCAS)P(B/A3) 
P(A, IJP(B/A,) + P(AZL)P(B/ A4) + PLAS)P(B / Aq) 
0.008 
0.0125+00140+0.0080 
0.008 
0.0345 


= 0.232 


Event Prior Conditional Joint Posterior 
Probabili Probabili Probabili Probabili 

















Illustration 


Assume that a factory has two machines. Past records 





show that machine 1 produces 30% of the items of output and 
machine 2 produces 70% of the items. Further 5% of the items 
produced by machine 1 were defective and only 1% produced by 
machine 2 were defective. If a defective item is drawn at random, 
what 1s the probability that the defective item was produced by 


machine 1 or machine 2? 
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Solution 
Let A, = the event of drawing an item produced by 


machine 1 
A, = the event of drawing an item produced by 


machine 2 
B = the event of drawing a defective item produced 


either by 
machine 1 or machine 2 
Then from the first information, 
Prior probability 
P(A,) = 30% = 0.30 
P(A2) = 70% = 0.70 
Conditional probability 
P(B/A,) = 5% = 0.05 
P(B/A2) = 1% = 0.01 
Joint Probability 
P(A, )P(B/A,) = 0.30 x 0.05 = 0.015 
P(A2)P(B/A2) = 0.70 x 0.01 = 0.007 


By Baye’s theorem, 


Posterior probability 
P(A,)P(B/A,) 
P(A, )P(B/A4) + P(Az)P(B/Az) 
0.015 | 
0.015--0.007 


O.01E 
D.n72 


— 0.682 


P(A,/B) — 





_ P(A,)P(B/Az) 
P(A, )P(B/A,) + P(A) P(B/A>) 
0.007 
0.015--n.^5n7 


P(A2/B) 
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Probability that the defective item was produced by machine 1 


or machine 2 


= P(A,/CB)+P(A2/B) 
= 0.682 + 0.318 
= ] 
THORETICAL DISTRIBUTIONS 
The students will be introduced in this section to the 
techniques of developing discrete and continuous probability 
distributions and its applications. We may think of a probability 
distribution. Where just like distributing the total frequency to 
different class intervals, the total probability (i.e, one) is 
distributed to different mass points in case of discrete random 
variable or to different class intervals in case of continuous 
random variable. Such a probability distribution, since such a 
distribution exists only in theory. We need study theoretical 
distribution for the following factors. 
(a) An observed frequency distribution in many a case, 
may be regarded a sample i.e a representative part of a 
large, unknown, boundealess universe on population 
and we may be interested to know the form of such 


distribution. By fitting a theoretical probability 
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distribution so say the lamps produced by a 
manufacturer, it may be possible for the manufacturer 
to specify the length of life of the lamps produced by 
him upto a reasonable degree of accuracy. By studying 
the effect of a particular type missiles necessary to 
destroy an army position. By knowing the distribution 
of smokers, a social activist may warm the people of 
locality about the nuisance of active and passive 
smoking and so on. 

(b) Theoretical probability distribution may be profitably 

employed to make short term projections for the future. 

(c) Statistical analysis is possible only on the basis of 

theoretical probability distribution. Setting confidence 
limits or testing statistical hypothesis about population 
parameter(s) is based on the probability distribution of 
the population under consideration. 

A probability distribution also possesses all the 
characteristic of observed distribution. Again a probability 
distribution may be either a discrete probability distribution or 
continuous probability distribution depending on the random 
variable under study. The two important discrete probability 
distributions are: 

a) Binomial distribution 
b) Poisson distribution 
Some important continuous probability distributions are 
a) Normal distribution 
b) Chi-square distribution 
c) Student's t-distribution 
d) F-distribution 
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Binomial Distribution 
Definition 

A random variable X is said to follow binomial 
distribution if it assumes only non-negative values and its 
probability mass function is given by 

P(X—x)-nGP' gq’, x-0,1,2,...n and p+q=1 
Notation 

X-B(n,p), n abd P are parameters of Binomial distribution 
1.e., X follows Binomial distribution with parameters n and p. 
Properties of Binomial distribution 

1. Mean of the binomial distribution 1s no. 
Variance of the binomial distribution is np 


2 
3. Standard deviation of the binomial distribution is ,/npg 
4 





: _ q-—p 
Moment coefficient of skewness y, = ape 
à Z l—624q 
5. Moment coefficient of kurtosis y ,_= 
npa 
Note : 
D EPX =x) = once pg" T =@mipy=-1 since 


p+q=1 
(1) If N is the total frequency, the expected frequencies of 
0,1,2,...n success are the successive terms of N(q+p)n. 
Ge) f«)=N.P(X=x) 
G11) PCX<1) = P(X=0) 
Gv) PCX<1) = PCX=0) + PCX=1) © 
(v) P(X<2) = P(X-0) + P(X-1) 
(vii PCX<2) = PCX=0) + PCX=1) + PCX=2) 
(vii) P(X-3)—P(X-0) + P(X-1) + P(X-—2) 
(vill) PCX<3) = P(X-0) + P(X-1) + PCX=3) 
(x)  P(X-1)- 1-PCX<1) 
(1.e.) PCX=1) = 1-P(X-0) 
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(x) PCX-2) = 1-P(X<2) 
(i.e.) PCX>2) = 1-[P(X-0) + P(X-1)j 
(xi)  P(X23)- 1-PCX<3) 
(i.e.) PCX>3) = 1-[PCX=0) + P(X-1) + P(X-2)] 


Illustration 27 
In a large consignment of electric bulbs 10% are defective. 


A random sample of 20 is taken for inspection. Find the 


probability that 
1) All are good bulbs 
ii) At most there are 3 defective bulbs 
iii) Exactly there are three defective bulbs 
Solution 


Given, n=number of samples = 20 


P= 10% = —=0.1 
q=1-P 

= 1-0.1 

=0.9 


Then 
P(X=x) = nGP* q'^, x=0,1,2,...n) 
P(X-x) = 20C,.(0.1Y*. (0.99°°* 
(1) P (all are good bulbs) 
= P (none are defective) 
= P (X=0) 
= 20Co. (0.1)°, (0.q)*° 
= 20Co x (0.1)? x (0.9y?? 
—1x1*x0.1216 
— 0.1216 


(11) P (at most there are 3 defectives) 
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= P(X<3) 

= P(X=0) + P(X-1) + P(X=2) + P(X-3) 

= 20C, (0.1) (09? + 20c' (0.1) 

(0.9)'7+20C,(0.1)7(0.9)!8 
ET 

20C;3(0.1)° 
(0.9)!7 

= 0.1215 + 0.27 + 0.285 + 0.19 

— 0.8666 


(111) P(exactly 3 defective bulbs) 

= P(X-—3) 

= 20C; (0.1y? (0.9)!” 

= 0.19 
Illustration 28 

A machine manufacturing screws is known to produce 5% 

defective. In a random sample of 15 screws, what is the 
probability that there are (i) exactly two defectives (ii) at least 
two defectives? 
Solution 


Give n = number of samples = 15 


5 
p = 5% = -pg = 0.05 


q = l-p 
= 1-0.05 
= 0.95 
Then binomial distribution is 
P(X=x) = nGP* q'*, x=0,1,2,...n) 
P(X-x) = 15C,(0.05)* (0.95)'>* 
(1) P (exactly two detectives) 
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= P(X=2) 
= 15C, (0.05)2 (0.95) 
= 0.1347 
(11) P (at least two defectives) 
= P(X>2) 
= ]-P(X-2) 
= 1-[P(X=0) + P(X-1)] 
= 1-[15C,(0.05)° (0.95)? + 15C,(0.05)' (0.95) 
= ]-[0.4632+0.3657] 
=1-[ 1 


Illustration 29 
Assuming that half the population are consumers of rice, 


ži 


so that the chance of an individual being a consumer is % and 
assume that 100 investigators each take 10 individuals to see 
whether these are consumers. How many investigators would you 
expect to report that 3 people or less were consumers. 

Solution : 


Given, n = 10, 


—i-0.5 
i 
N=10-0 


Thus probability mass function binomial distribution is 
P(X=x) = nC,p* q 
P(X-x) = 10C,(0.5)*(0.5)!°* 
P (3 people or less were consumers) 
= P(X<3) 
= P (X=0) + PCX=1) + PCX=2) + P(X-3) 
= 10C,(0.05)° (0.5)? + 10C, (0.5)! (0.5) 
+10C,(0.5)°(0.5)° 
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-10C3(0.5)*(0.5)? 


= 10C4(0.5)'^-10C,(0.5)!?2-10C4(0.5)!?*--10C4(0. 5)!? 
= (0.5)? [10C97-10C,7-10C27-10C] 
= (0.5)'°[1+10+45+120] 
= (0.5)? x 176 = 0.1718 
Expected number of investigators be reported as three are less 
were consumers 
= N x P(X<3) 
= 100 x 0.1718 
= 171.8 
= 172 


Illustration 30 
6 dice are thrown 729 times. How many times do you 
expect atleast three dice to show a five or six? 
Solution 
P = Probability of getting 5 or 6 with one die 
= P (getting 5) + P (getting 6) 


— 1 a 
Ei 
| i+} 2 1 
|^ 4 A R 
q — ]-P 
_4_2 
3 
| 3-1 
T a 
_ 2 
ES 


Thus probability mass function binomial distribution is given by 
P[X-x] = nC,.Pq' 
PIX =x)] = ec. (2) G) 
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P (atleast three dice showing five or six) 
= PCXS3) 
= P(X=3) + P(X-4) + PCX=5) + P(X=6) 


= 0c, (2)° @' «ea (2)' Q^ + 0058)" Q' 
«ec, O O 


23 22 i 
= 20x Ta + 15x zz t 6x7 T 1x ga 


L60--60--12--1 

— gg 
_ 233 
15 


For 729 times, the expected number of times atleast 3 dice 


showing five or six. 


= NxP(X>3) 


= 729 X=, 
= 233 times 
Illustration 31 
If the probability of defective bolts is 0.1, find (a) the 
mean and standard deviation for the distribution of defective bolts 
in a total of 500 and (6) the moment coefficient of skewness and 
kurtosis of the distribution. 
Solution 
Given n=500 
P=0.1 
Q=1-p=1-0.1 = 0.9 
a) Mean of the binomial distribution 
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= 500 x O.I 
= 50 


Standard deviation of the binomial distribution 
= np 
= 4/500 x 0.1 x O.9 
— 6.71 


b) Moment coefficient of skewness 








K "on a p ae 0.9-0.4 

r1 nP ¥500x0.120.5 
| Og 
O ATI 
= 0.119 


Since , 1s more than zero the distribution is positively skewed. 
However, skewness is very moderate 


Moment coefficient of Kurtosis 


: " i—copq 
2 "Pg 


||. 1—6(0.1)(0.9) 
500 x 0.1 0.5 
0.46 


44.93 


= 0.01 


Since 321s positive the distribution 1s platykurtic. 


POISSON DISRIBUTION 

In the previous section we have discussed a discrete 
probability distribution Binomial distribution. Now we shall 
discuss another discrete probability distribution called poisson 
distribution. The characteristic of the poisson distribution are 

(1) The occurrence of the events are independent 

(11) The number of trials is large 

(iii) The probability of occurrence is very small 
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Examples of poisson distribution are 
(1) The demand for a product 
(11) The occurrence of accident in a factory 
(iii) The arrival pattern in a departmental store 


(v) The arrival of calls of a switch board 


Definition 
A random variable X is said to follow poisson distribution, 


if it assumes only non-negative values and its probability mass 


function is given by 
e Cyn 


art 





P(X = x)= x = 0,1,2 .... 
This distribution has only one parameter m and m-—mean = np. 
Properties of Poisson distribution 
1. Mean of the Poisson distribution = np 
Variance of the Poisson distribution = np 


2 
3. Standard deviation of the Poisson distribution = ,/np 
4 


"P i 
Skewness of the Poisson distribution — 2 


"m 1 
5. Kurtosis of the Poisson distribution = 3 + 


Illustration 32 
If 5% of the items produced by a factory are defective, use 
Poisson distribution to find the probability that in a sample of 100 


items 

(1) None is defective 

(ii) 5 items will defective (Given e°=0.007) 
Solution 


Given, P = Probability of producing a defective item 
= 5% | 


5 
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Thus, 


P(X = x)= 


G) 


G1) 


and n = 100 
m-—np = 100 x 0.05 =5 


probability mass function Poisson distribution 
a 
e ?(53* 
P (none is defective) = P (X=0) 
e rgy 
m n! 
0.007.x1 
o 1 
— 0.007 
P (5 items will be defective) 
= P(X=5) 
e 5(5j5 


5t 
D.D007 x 312E 
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= 0.1823 


Illustration 33 


is 


A car hire firm has two cars which is hires out day by day. 


The number of demands for a car on each day is disbuted as a 


Poisson distribution with mean 1.5. Calculate (1) the proportion 


of days on which neither car is used and (11) the proportion of 


days on which some demand is refused (e! ^—0.2231) 


Solution 


Given that M=1.5 


Thus the probability mass function of Poisson distribution 


J= e 1571.59” 
RE x! 
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(i.e) POY = 


(i) PReither car is used) 
= PCX-—0) 


g 9651,55? 
ot 
O.2231x Ë 
i 


Hence the proportion of days that neither car is used 
= N.P(X-0) 
— 365 x 0.2231 
— 81.4 days 


= 0.2231 


(11) P (some demand 1s refused) 
= P(X>2) 
= 1-[PCX<2] 
= 1-[P(X=0)+ PCX=1) + PCX=2)] 


1 E C3-P(1.83)^ | € COCQ1i.S)^ e *"¢1.5)* 
Hbc 0! + it 2! 





—1.5 E 1.5 aa 
= Le ila: ee 
p E + i + = 


= 1-0.2231 [3.625] 

= 1-0.8087 

= 0.1913 
Hence the proportion of days that some demand is refused 

= NxP (X>2) 

= 365 x 0.1913 

= 69.83 days 
Illustration 34 

BHEL, Trichy, employing a large number of workers finds 

that over a period of time, the average absentee rate is three 
workers per shift. Calculate the probability that on a given shift 
G) exactly 3 workers will be absent (ii) more than 2 workers will 
be absent [e °=0.0497] 


Solution 
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Given that m = average rate = 3 
Thus probability mass function of Poisson distribution is 


e a7 
P(X = x) = ———_ 
xi 


—3 AX 
P(x =x) = 5 UM 


x1 
Q) P (exactly 3 workers) = P (X=3) 
e 3x37” 
3! 
0.0497 x 27 


= — = 0.224 
A 


GD) p (more than 2 workers are absent) 
= P(X>2) 
= 1-P(X<2) 
= ]-[PCX-0) + P(X-1) + PCX=2)] 


= 1— fe’, ae , so) 





E oF 3 
= ]-[0.045 + 0.149 + 0.223] 
= 1-0.417 
= 0.583 


illustration 36 
A book contains 100 misprints randomly throughout its 
100 pages. What is the probability that a page observed at 


random contains at least 2 misprints? [e1—0.368] 


Solution 


Given that 


= ps = 1 = The average nurnber of misprints per page 


Thus, the probability mass function of Poisson distribution 
1S 


PX = x)= 
x! 
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P(X =x)= EE 
x! 
P(at least 2 misprints) 
— P(X-2) 
= 1-P(X<2) 
= 1-[P(x-0) + P(x-1)] 
= E =o 4 dicm 





0! i! 
p tat e ixi 
oS ot a t 
= ]-[2e'!] 
= 1-0.736 
= 0.264 


Illustration 37 
It is given that 3% of electric bulbs manufactured by a 


company are defective. Using the Poisson approximation, find 
the probability that a sample of 100 bulbs will contains (1) No 
defective (11) Exactly one defective [e^ —0.05] 
Solutions 
Give that P=3% = 3/100 = 0.03 
n=100 
m — np = 100 x 0.03 =3 
Thus, probability mass function of Poisson distribution is 





— WE 
P(X =x) = Z ni 
95 AX 
POC = x) = EO 
(i) P(Üno defective) = PCX=0) 
u e ?(a)? 
BEEN 
—3 
= -~=0.05 


(11) P(one defective) = P(X-1) 
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Illustration 38 
The mean of the Poisson distribution is 2.25. Find mean, 


variance, standard deviation, skewness and kurtosis of the 
Poissoin distribution. 
Given that mean m=2.25 = np 
G) Mean of the Poisson distribution m=np=2.25 
G1) Variance of the Poisson distribution = np = 2.25 


(111) Standard deviation of the Poisson distribution 


= {np 
= 42.25 
= 1:5 
(1v) Skewness of the Poisson distribution 
1 1 X 


(v) Kurtosis of the Poisson distribution 
1 


= 3 + = 340.444 — 3.444 
eS 


NORMAL DISTRIBUTION 


Definition: A random variable X is said to follow normal 
distribution with mean p and variance ^, if its probability 


density function is given by 





1 
(x) = d o .— 90 < X — o0, > 0,—o0o0 — n — oo 
d an ZT 


Where u~mean of the random variable x 
—standard deviation of the normal distribution 


=the constant 3.14. 
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Standard Normal Distribution 


Let X is a normal variable with mean p and S.D. . Define 


Z = LE then Z is called standard normal variable then 
n 





4 _ a 
fF) = oven 2d 


Properties of normal distribution : 

1. The normal curve is bell shaped 
In normal distribution, mean — median — mode 
It is symmetrical around the distribution mean 
If ranges between -œ to œ 


The total area under normal curve 1s 1 


UB 2 a E. 


The area under the curve between two points is the 
probability that a variable which is normally distributes 
will assure that value between those two points. 
Illustration 39 
The weekly wages of 1000 workmen are normally 

distributed around a mean of Rs. 70 witha S.D of Rs. 5. Find the 
number of workers whose weekly wages will be 

G) Between Rs. 69 and Rs. 72 

(11) Less than Rs. 69 

Gii) More than Rs. 72 
Solution 
Given that, 
u = mean of the normal distribution = 70 


= standard deviation of the normal distribution = 5 


When X=69, then z = LE M = : = —(.2 
When X=72 then z = — = eS — 0.4 
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(1) P (weekly wages between Rs. 69 and Rs. 72) 
= P(69<X<72) 
= P(-0.2 <Z<0.4) 
= P(-0.2<Z<0) + P(O<Z<0.4) 
= P(O<Z<0.02) + P(O<Z<0.4) 
= 0.0793 + 0.1554 
= 0.2347 
Out of 1000 workmen, the number of workers whose wages lies 
between Rs. 69 and Rs. 72 
=N x P (69 < X < 72) 
= 1000 x 0.2347 
= 234.7 
= 235 | 
(11) P (weekly wages less than Rs. 69) 
= P(X<69) | | 
= P(Z<-0.2) : 
= 0.5-P(-0.2<Z<0) | 
= 0.5-P(0<Z<0.2) | 
= 0.5-0.0793 02 | 
= 0.4207 Sa NPA mmn tete inda 


méme vm RA, reer RANA rri mieu arma AR a i 





Out of 1000 workmen, the number of workers whose wages less 
than Rs.69 

= NXP(X<69) 

= 1000 x 0.4207 | 

= 420.7 | 

~ 421 
Gii) PGweekly wages more than Rs. 72) | : 0.4 

= P(X>2) 

= P(Z>0.4) 

= 0.5 — P(<Z<0.4) 
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= 0.5 — 0.1554 = 0.3346 
Out of 1000 workmen, the number of workers whose weekly 
wages are greater than 72 
= N.P(Z>4) 
= 1000 x 0.3446 
= 344.6 
= 345 
Illustration 40 
The customer accounts at a certain departmental store 
have an average balance of Rs. 480 and a standard deviation of 
Rs. 160. Assuming that the account balances are normally 
distributed. 
(1) What proportion of the accounts is over Rs. 600? 
(11) What proportion of the accounts is between Rs. 400 and 
Rs. 600 
(31) What proportion of the accounts is between Rs. 240 
and Rs. 360 
Solution 
Given that mean 1-480, standard deviation —160 
(i) When X=600, the standard normal variate 


"- 
s 
600 — 480 
^ A 160 
= 0.75 


P(X>600) = P(Z>0.75) 


= 0.5-P(0<Z<0.75) i Ree OR: le MRC RY 


= 0.5 — 0.2734 
= 0.2266 
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Hence 22.66% of the accounts have a balance in onless of Rs. 
600 


(11) Where X=400 


Then standard normal variable z = ŽE 
: c 
_ 400-480 yp 
160 
When X = 600, 
Then standard normal variable Z = — 
_ 600-480 _ 5. 





P(400<X<600) = P(-0.5<Z<0.75) 
= P(-0.5<Z<0) + P(O<Z<0.75) 
= P(O<Z<0.5) + P(OQ<Z<0.75) 
= 0.1915 + 0.2734 = 0.4649 
Hence 46.49% of the accounts have an average balance between 
Rs. 400 and Rs. 600. 
(11) When X=240, then 





Standard normal variable 7 = E 
| 240—480 
| 160 
i Z240 = 0 75 -1.5 -0.75 0 
TAN LLL i 
When X = 260 then 
Standard normal variate 
g .£-* 
£1 
_ 360—480 
^ A 160 
=-1.5 
P(240<X<360) = P(-0.75<Z<-1.5) 


= P(1.5<Z<0) — P(-0.75<Z<0) 
= P(0zZ«1.5) — P(O<Z<0.75) 
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= 0.4332 — 0.2734 
= 0.1598 
Hence 15.08% of the accounts have an average balance 
between Rs. 240 and Rs. 360 
Illustration 41 
It is known that from the past experience that the number 
of telephone call made daily in a certain city between 10 am and 
11 am has a mean of 352 and standard deviation of 31. What 
percentage of times will there be more than 400 telephone calls 
made in this locality between 10 am and 11 am? 
Given that mean u = 352 
and standard deviation —31 
When X = 400 then 


Standard normal variate 
X-u 
LE) 


|. 400—352 0 ue 
35 


= 1.55 
The probability that there will be more than 400 calls 
= P(X>400) 
= P(Z>1.55) 
= 0.5-P(O0<Z<1.55) 
= 0.5 — 0.4394 (using standard normal table) 
= 0.606 | 
Hence the percentage of days on which the number of calls will 
exceed 400 is 60.6% 
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Illustration 43 


In a normal distribution 31% of the items are under 45 and 
8% are over 64. Find the mean and standard deviation of the 


distribution. 
Solution 

Let the mean of the normal distribution be u and standard 
deviation of the normal distribution be 

The are lying to the left of the ordinate at X=45 is 
31%=0.31. 


The are lying to the right of the ordinates up to the mean is 
0.5-0.31=0.19 


1995 





ordinate 


31% 
8% 


| 
| 
| 
| 
| 
| 
| 
| 


The value of Z corresponding to the area 0.19 is 0.5 
nearly. 
SI = —Q.5 
45 -+ [2 
G 
45-u = -0.5 
45=-0.5 +u 


-0.5 +u=45 (1) 


= —0.5 
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Area to the left of the ordinate at X=64 is 0.5-0.08 = 0.42 and 


hence value of Z corresponding to this area is 1.4 nearly. 
X—H 


6-4— gL 


= 1.4 


= 1.4 





64-1 = 1.4 

64—1.4 +u 

1.4 +u=64 (2) 
Solving (1) and (2) 

1.4 +u=64 

-0.5 +u=4.5 

1.9 =19 


19 
1.9 


g = = 10 
=10 
Substituting | —10 in equation (2) 
1.4 +u = 64 
1.4 x 10 + u = 64 
u = 64-14 
u = 50 
Mean = u=50 
Standard deviation =10 
Illustration 44 
In a normal distribution, 7% of the items are under 35 and 
87% are under 63. What are the mean and standard deviation of 
the distribution. 
Let mean of normal distribution be u standard deviation of 


normal distribution be 
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43% 
39% 


ordinate 


71% 


11% 


x=35 x=83 | 
v d 

The are lying to the left of the ordinate at x—35 is 0.07. 

The corresponding value of Z is negative. 

The area lying to the right of the orinates at x—63 upto 
mean 1s 

0.5-0.07 = 0.43 
The value of Z corresponding to the area 0.43 is 1.4757 


Ge) =" = —1.4757 
35— $ 


cx 


35-n—-1.4757 
35 = -1.4757 +n 





= —1.4757 


The area lying to the left of the ordinate at X—63 upto the 


mean 1s 0.39. 
The value of Z corresponding to the area 0.39 is 1.2263. 
T-E — 1.2263 
ao 


63—44 


63-u=-1.2263 

63 = -1.2263 +n (2) 
Solving (1) and (2) we get 

Mean u = 50.288 

S.D = 10.36 


= — 1.2263 
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Uniform Distribution | 
A random variable ‘x’ is said to have a uniform 
distribution, if its probability density function is given by 
EL <x < B 
fo) = = ý 


Q other wise 


Properties of uniform distribution 


: . . b 
1. Mean of the uniform distribution = — 
. "Nm b—a)? 
2. Variance of the uniform distribution = — 


Moment generating function of uniform distribution 
E Ea = E f 
Illustration 

If X is uniformly distributed random variables with mean 


1 and variance =, find P(X<0) 


Solution 
For uniform distribution, we know that 


Variance = —— = 


EN > - l12ax- 
—»(b — a) : 


(b-a) = 16 
bea=+4 ween (2) 





Solving (1) and (2) we get 


a—-1, b=3 

l 001 

m f x) B B—a 
i 


3—(—1^ 








i. 
= Zin -]<x<3 
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- PIX < 0) = i fixjdx 


"E 
—ilo0— (—1)1- 2 


Illustration 

Subway trains on a certain line run every half hour 
between midnight and six in the morning. What is the probability 
that a man entering the station at a random time during this 
period will have to wait atleast twenty minutes. 
Solution 

Let x be the random variable which denotes the waiting 
time for the next time. 

Assume that a man arrives at the station at random, the 


random variable X is distributed uniformly in (0,30) with p.d.f 
+ 


roD = fea? =< X < pb 


O otherwise 


i 
——— = < 
fG»)- {30-0°° -— X < 30 
O otherwise 


t 9o9< xX¥ < 30 
== .30—tc 


Q otherwise 
P(atleast 20 minutes) = PCX>20) 
= L. Fixjdx 


30 t 
aur nhe 





ax 
i 
= 55 o 
i 
= — {80 — 20) 


=> a) 


| 
^B 
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SUMMARY 
In this unit we come to know that the calculations of probability 


of an event, addition theorem of probability, multiplication 


theorem of probability, 


and  Baye's theorem, binomial 


distribution, Poisson distribution and normal distribution. 


KEYWORDS 


1. 
2. 


Event 

Independent events 

Probability of an event 

Addition theorem of probability 
Multiplication theorem of probability 
Baye’s theorem 

Probability mas function 

Probability density function 


Binomial distribution 


10. Poisson distribution 
11. Normal distribution 


KEY TO CHECK YOUR PROGRESS: 


1. 


From a bag containing three red and two black balls, two 
balls are drawn at random. Find the probability they are 


the same colour. [And : =|. 


If from a pack of cards a single card is drawn. What is the 
16 


probability that it is either a spade or a king. [and : =] 
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Find the probability drawing a queen, a king and a Jockey 
in that order from a pack of cards in three consecutive 
draws, the cards drawn not been replaced. Ans = 0.00048 
A class consists of 80 student, 20 of them are girls and 55 
boys, 10 of them are rich and remaining poor 20 of them 
are fair complexioned. What is the probability of a fair 
complexioned rich girl? [Ans : 0.0098] 
The probability that a boy will get a scholarship is 0.9 and 
that a girl will get is 0.8 what is the probability that atleast 
one of them will get the scholarship? [Ans : 0.98] 
The incidence of a certain disease is such that on the 
average 20% of workers suffer from it. If 10 works are 
selected at random, find the probability that (i) exactly 2 
workers suffer from the disease (ii) not more than 2 
workers suffer from the disease AM (1) 0.302 (ii) 0.678 
In a town 10 accidents took place in a span of 50 days. 
Assuming that the number of accidents per day follows the 
Poisson distribution. Find the probability that there will be 
three or more accidents 1n a day [Ans : 0.001] 
An aptitude test for selecting officers in a bank was 
conducted on 1,000 candidates, the average score is 42 and 
the standard deviation of scores in 24. Assuming normal 
distribution for the scores find 
a) The number of candidates whose score exceeds 
58. 
b) The number of candidates whose score lies 

between 30 and 66. 

Ans a) 252 b) 533 
A sample of 100 day dry battery cells tested to find length 


of the life produced the following results : 
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Mean = 12 hours, =3 hours | 
Assuming the data to be normally distributed, what 
percentage of battery cells are expected to have life 

1) More than 15 Hours 

11) Less than 6 hours and 

111) Between 10 and 14 hours 
Ans :a) 15.87% b) 2.28% c) 
49.74% 

10. At a certain examination 10% of the students who 
appeared for the paper in statistics got less than 30 marks 
and 97% of the student got less than 62 marks. Assuming 
the distribution is normal. Find the mean and the standard 


deviation of the distribution. 
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UNIT IV : INDEX NUMBERS AND TIME SERIES 


Introduction 


Historically, the first index was constructed in 1764 to 
compare the Italian price index in 1750 with the level in 1500. 
Though originally developed for measuring the effect of change 
in prices, index numbers have today become one of the most 
widely used statistical devices and there is hardly and field where 
they are not used. Newspapers headline the fact that prices are 
going up or down, that industrial production is rising or falling, 
that imports are increasing or decreasing, that crimes are rising in 
a particular period as disclosed by index numbers. They are used 
to feel the pulse of the economy and they have come to be used 
as indicators of inflationary or deflationary tendencies. In 
fact, they are described barometers of economic activity. (i.e) if 
one wants to get an idea as to what is happening to an economy, 
he should look to important indices like the index number of 
industrial production, agricultural productions, business activity 
etc. 

Some prominent definitions of index numbers are given 
below. | 

1. Index numbers are devices for measuring differences in 
the magnitude of a group of related variables". - 

Croxton & Cowden 

2. An index number is a statistical measure designed to show 
changes in a variable or group of related variables with 
respect to time, geographic location or other characteristic 
such as 1ncome, profession, etc." 


- Spiegel 
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3. Generally speaking, index numbers measure the size or 
magnitude of some object at a particular point in time as 
percentages of some base or reference object in the past. 

- Berensen & Levine 
Unit objectives 
The objectives of the unit are as follows 
To understand the various types of index numbers 
To construct consumer price index 
To know about the various tests of perfection of index 
numbers. 
Unit structure 
Introduction 
Definition 
Unit objectives 
Unit structure 
Characteristic of index numbers 
Uses of index numbers 
Types of index numbers 
Business index number 
Sensex, Nifty 
Problem related to construction of index numbers 
Weighted price index(WPI) 
Tests of consistency of index numbers 
Chain base Index and Fixed base index 
Construction of consumer price index(CPI) 
Aggregate Expenditure method 
Family Budget method 


Production index. 
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Limitations of index numbers 


Exercise 


Characteristic of Index Numbers 


l. 


Index numbers are specialized average helps comparing 
the changes in variables which are is different unit. 
Index numbers are expressed in percentages to show the 
extent of change without using percentage sign (70) 
Relative variations are measured with the help of index 
numbers. Index numbers are for comparison they compare 


changes taking place over time or between places and like 


categories. 


Uses of Index Numbers 


1. 


Types 


The index numbers are used as end results in. many 
situations. For example index numbers are often cited in 
new reports and acting as general indicators of economic 
condition. 

Index numbers are used as part of an intermediate 
computation to understand other information latter. 

Sales indices were used to modify and improve the 
estimates of future. 

Consumer price index is used to determine the “real” 
buying power of money. 

of Index Numbers 


There are various types of index numbers. 


The most important types are 


1. 
Ze 
3. 


Price index numbers 
Quantity index numbers 


Value index numbers. 
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Price Index Numbers 
Price Index is a normalized average of prices for a given 


class of goods or services in a given region, during a given 
interval of time. It is a statistic designed to help in compare how 
these prices, taken as a whole, differ between time periods or 
geographical locations price index have several potential uses. 
For particularly broad indices the index can said to measure the 
economy’s price level or a cost of living. More narrow price 
indices can help producers with business plans and pricing. 
Sometimes, they can be useful in helping to guide investment”. 
Quantity index Numbers 

Quantity Index numbers study the changes in the volume 
of goods produced or consumed. For instance industrial 
production, agricultural production, import, export etc., they are 
useful and helpful to study the output in an economy. 


Value Index Numbers 


Value Index numbers compare the total value of a certain 
period with the total value of the base period. Here the total value 
is equal to the price of each, multiplied by the quantity. For 


instance, indices of profits, sales, inventories etc., 


Business Index Number 


Today index numbers are one must widely used statistical 
tools in Business. They are used to the pulse of economy and 
come to used as indicate as of inflation and deflationary 
Tendency Application of index numbers in business is called 
Business Index Number. It is integration of business practices 
and index numbers for the purpose facilitating forward planning 


and decision making by the management. 
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Application index number in business 


1) Index numbers act as economic parameters. 
11) Index numbers help in policy formulation. 
111) It reveals trends and Tendencies. 


1v) It helps to measure purchasing power. 


INDICES USED IN INDIAN CAPITAL MARKET: 


Indices are used to give information about the price 
movements of securities in the stock markets. Stock Market 
Indices are meant to capture the overall behavior of equity 
market. A share index is a compilation of several shares prices 
into a single number. A stock market index is created by 
selecting a group of stocks that are representatives of the whole 
market or a specified sector or segment of the market. An Index 
is calculated with reference to a base period and a base index 
value. The movement in an index represents the overall increase 
or decrease in prices of shares included in the index. However, it 
does not mean that all scrips have moved in a particular direction. 
The basic idea of an index is that one is able to get an 1dea or a 
broad picture about how the market or a particular segment has 
performed by looking at the index rather entering the complicated 
process of looking at each scrip. Stock market indices are useful 
for a variety of reasons. Some of reasons of maintaining share 


indices are : 


They provide a historical comparison of returns on money 
invested in the stock market against other forms of 
investments. 

They can be used as a standard against which to compare 


the performance of an equity fund. 
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It is an indicator of the performance of the overall 
economy. 

Stock indices reflect highly upto date information. 
Indices based Index Funds, Index Futures, Index Options 


play an important role in financial investments and risk 


management. 


Share indices reflect the changing expectations of the stock 
market about India’s corporate sector. When the index goes up, it 
is because the stock market thinks that the prospective growth 1n 
the future will be better than previously thought. When prospects 
of dividends in the future become pessimistic, the index drops. 
The index gives, an instant reading about how the stock market 
perceives the future of India’s corporate sector. 

Indices are calculated as weighted average and each share 
(security) is given weight proportional to its market 
capitalization. Suppose, an index consists of three shares A, B 
and C and total capitalization of these 3 securities is Rs. 350 
crores (A — Rs. 200 crores, B — Rs. 100 crores and C- Rs. 50 


crores), then weights of these securities in the index are 4:2: 1. 


‘Types of Indices: 
Different types of indices are available in practice. 


These can be broadly classified into two groups: 


1) Broad Market Index : 
These indices consist of all the large stocks of the 


capital market. A broad market index works as a 


benchmark of the entire capital market and also of 
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entire economy. S & P CNX NIFTY, S & P CNX 500 
and SENSEX are example of broad market indices. 


11) Specialised Sector Indices : 
‘These are also known as Sectoral Indices. These 


are the indices calculated in respect of shares of 
companies operating is a particular sector of the 
economy. BSE — PSU is a specialized index dealing 
with the price movement in the shares of PSUs. 
Recently, in July 2007, BSE has launched ‘BSE 
Reality Index’, comprising of 11 companies engaged in 
real estate development business. 

In India, there are number of indices being 
calculated by BSE, NSE, Research Houses, Business 
Newspapers. NSE is calculating 9 major and 16 other 
indices on the basis of share prices movement on NSE. 
Similarly, BSE is also calculation 15 different types of 
indices. However, among all, the most important has 
been the SENSEX, the price sensitivity index of BSE 
Ltd. and the NIFTY. 


SENSEX: The Barometer of Indian Capital 
Miarket. 

SENSEX is based on the closing prices of 30 constituent 
shares which are periodically selected on the basis of large, liquid 


and representative companies. 


BSE was established as the first stock exchange in India in 1875. 
Since then, it has occupied the place of prime relevance in the 
Indian Capital Market. In 1986, BSE came out with a share 
index SENSEX, which has since become the barometer of 


Capital Market in India. SENSEX has the base year 1978 — 79 
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for which the index number is taken as 100. SENSEX is based 
on the closing prices of 30 constituent shares which are 
periodically selected on the basis of large, liquid and 
representative companies. There has been a regular upward and 
downward movement in the SENSEX to reflect the price 
behavior of the 30 shares. Initially, SENSEX was calculated on 
the basis of total market capitalization of a particular company. 
However, w.e.f. Sept. 1, 2003, SENSEX has been converted into 
a free float index. Now, the weights of different shares are 
assigned on the basis of market capitalization of freely trading 
shares. The shares held by promoters are kept outside the index 
calculation. Now, SENSEX is calculated using the “Free Float 
Market Capitalization" method in which the level of index at any 
particular point of time reflects the market value of free floats of 
30 shares. The market value of total shares is multiplied by the 
free float factor to find out the free float market capitalization. 
During trading hours, the current prices of index scrips are used 
by the SENSEX mechanism to calculate the value of SENSEX 
every 15 seconds. The closing SENSEX value for the day is 
computed on the basis of weighted average of all trades in all the 
30 shares during last 30 minutes of the trading session. Weight 
assigned to a particular share is adjusted for Right, Bonus or 
Public issue made by the company. As on May 31, 2011, 30 
shares included in SENSEX, their weights, free- float factors and 


the B factor are given in Table. 
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Table: SENSEX composition, Weights, Free — 
float factor and f as on 31-5-13 


Weightage | Beta | Co-efficient of | Free 
Company (%o) in Values 
SENSEX 






Detetmination Float 


(R?) Factor 


aS On 


Reliance 


. | Infosys Tech 


0.49 


0.55 


ITC Ltd. 


Larsen & 
Toubro 





Housing 1.14 0.52 
Devpt. | 
. | HDFC Bank 1.03 0.52 0.80 
Ltd 
1.20 0.46 0.45 


a B 
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Mahindra & 
Mahindra 






State Bank of 
omis 
LING LLLA 

10. 


Hindalco 1.79 .50 
intone || 
Hindustan 2.22 59 
— n | 
Sterilite 1.75 1.31 
lud 
68 .77 


- 
QO 


17 
19 


O 
O 
O 


0.47 


8 
© 


.- 
* 
* 
* 


0.34 
0.55 


O 


wI N| m 


NJ 
IN 


ro L 

] Maruti 1.20 0.85 0.27 
Suzuki 

2 


EE 


27 REL 0.56 1.16 
Infrastructure i ft | 
Jaiprakash O. 1 
Associates 
29. | DLF Limited 0.52 


30. Reliance 0.44 1.16 0.21 
Com. Ltd. 


SENSEX 1.00 


.53 
.20 
45 
.13 
.24 
.49 


O 
O 
O0 


28 


Beta = Co-variance (SENSEX, Stock)/Variance (SENSEX) 


R? = (Correlation)? 
Average Daily Volatility = One standard deviation of daily 
returns of individual stock price for last one year 


Returns — Variation in the stock price over last one year 
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7 


© 
© 


© 
Uu 
o 


4 


o 


0.45 
0.25 
0.70 
0.70 
0.50 


0.65 


0.50 


0.55 


0.55 


0.25 
0.35 


S & P CNX NIFTY: 
It is also called NIFTY. NSE introduced NIFTY in 


association with CRISIL on November 3, 1995 when it started 
with base 1,000. It is built out of 50 largest and highly liquid 
stocks. Over the years, NIFTY had emerged as an important 
indicator of the movement of prices of important shares on the 
National Stock Exchange. With effect from July 2009, NIFTY is 


calculated based on Free — Float Capitalization methodology. 


S & P CNX Defty: 


5 & P CNX Defty is S & P CNX Nifty, measured in US $ 
terms. Ifthe S & P CNX Nifty rises by 2%, it means that the 
Indian stock market rose by 2% measured in rupees. If the S & P 
CNX Defty rises by 2%, it means that the Indian stock market 
rose by 2%, measured in US dollars. The S & P CNX Defty is 
calculated on real time basis. Data for the S & P CNX Nifty and 
the dollar — rupee rate is absorbed in real time, and used to 
calculate the S & P CNX Defty in real time. When there is 
currency volatility, the S & P CNX Defty is an ideal device for a 


foreign investor to know where he stands, even intraday. 


S & P CNX 500: 


The S & P CNX 500 is India’s first broad based benchmark 
of the Indian capital market based on the prices of 500 shares. 
The S & P CNX 500 represents about 90% of total market 
capitalization. 500 companies are disaggregated into 72 
industries. Industry weightages in the index dynamically reflect 
the industry weightages in the market. For example, if the 
banking sector has a 5 % weightage among the universe of stocks 


on the NSE, banking stocks in the index would have an 
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approximate representation of 5% in the index. The S & P CNX 
500 is a market capitalization weighted index. The base date for 
the index is the calendar year 1994 with the base index value 
being 1000. Companies in the index are selected based on their 
market cap italizati on, industry representation, trading interest and 
financial performance. The index is calculated and disseminated 


on real — time basis, 


S & P CNX NIFTY Junior: 


S & P CNX Nifty Junior is an index built out of the next 50 
(after those included in NIFTY) large, liquid stocks in India. It 
may be useful to think of the S & P CNX NIFTY and the CN X 
NIFTY Junior as making up the 100 most liquid stocks in India. 
CNX NIFTY Junior can be viewed as an incubator where young 
growth stocks are found. As with the S & P CNX NIFTY, stocks 
in the CNX NIFTY Junior are filtered for liquidity, so they are 
the most liquid of the stocks excluded from the S & P CNX 
NIFTY. The maintenance of the S & P CNX NIFTY and the 
CNX NIFTY Junior are synchronized so that the two indices will 
always be disjoint sets, i.e., a stock will never appear in both 


indices at the same time. 


CNX Midcap 200: 


CNX Midcap 200 is computed using market capitalization 
weighted method, wherein the level of the index reflects the total 
market value of all the stocks in the index relative to a particular 
base period. The method also takes into account continuous 
changes in the index and importantly corporate actions such as 


stock splits, rights, etc. without affecting the index value. The 
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base period is calendar year 1994, indexed to a value of 1000. 
Therefore, the value of CNX Midcap 200 at any point of time 
reflects the aggregate of the market capitalization of the index 
constituents relative to the base value of 1000. There are 200 


companies in this index. 


Ss & P CNX NIFTY Shariah. 


There are some investors who want to invest as per 
Shariah Guidelines (their religious beliefs), under which 
investment in companies which are engaged in following 
activities is not allowed : Alcohol, Gambling, Financials, 
Advertising and Media, Tobacco, Forward trading in Gold and 


Silver, Pork. So, there was felt a need for Shariah comply index. 


NSE took the lead and w.e.f. February 19,2008, S & P 
CNX NIFTY Shariah Index was launched. For the purpose of 
this index, the 50 companies comprising NIFTY are screened for 
Shariah compliances. Those companies fulfilling Shariah 
compliances form the S & P CNX NIFTY Shariah Index. 
Simultaneously, NSE also launched S & P CNX 500 Shariah 
Index for which Shariah compliant companies are screen from S 


& P CNX 500 Index. 


BSE has also launched BSE TASIS Shariah 50 Index on 
December 27, 2010. All these Shariah indices are based on Free 
— Float Market Capitalisation weighted methodology. 


Methods of constructing Index Numbers 
A large number of formula have been devised for 
constructing index numbers. Broadly speaking, they can be 


rouged under two heads. 
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The following chart illustrates the various methods. 


Index Numbers 
Unweighte Weighted 
Simple sis sh Weighted hia, qnte 
A t Average of OTOR Average of 
DEUS Relatives B8108 Relatives 


Unweighted Price Index Number 


1. Simple Aggregative Method 
This is simples method of constructing the index Numbers. 


The prices of the different commodities of the current year are 
added and the total is divided by the sum of the prices of the base 
commodity and multiplied by 100. Symbolically 





Where P, — Current year price 


Po — base year price 


Illustration 1 
From the following data constructed an index for 2012 


taking 2011 as base. 
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Simple Aggregative Method 


=P 
Price index Number Po, = -p X100 
G 


— 390.400 = 120 
RGN 


This means that as compared to 2011, in 2012 there is a 
net increase in the prices of commodities included in the index to 
the extent of 20%. 

2. Simple average of price relative method 

In this method, the price relative of each item is calculated 
separately and then averaged. 

A price relative is the price of the current year expressed 
as a percentage of the price of the base year. 

1) When arithmetic mean is used for averaging the price relatives, 


the formula for computing the 1ndex 1s 


Where N refers to the number of items. 
ii) When geometric mean is used for overaging the price relatives 
the formula for computing the index 1s 
zi oggixi0 1 


Poi ~ antilog | x 
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Illustration 2 
Compute a price index for the following by an average of 


price relative method by using both arithmetic mean and 


Geometric mean 





Solution 


— for price index 





= 122.92 


b) Geometric mean of the price relative index number 


Poi = antilog (- XlogP ) 
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= antilog (32:6) 
— 121.7 
Weighted Price Index Number(WPLI) 
Weighted index number is of two types. They are 
(1) Weighted aggregate price index number 


Gi) Weighted average of price relatives. 


Weighted Aggregative Indices 

These indices are of the simple aggregative type with the 
fundamental difference that weights are assigned to the various 
items included in various methods of assigning weights and 
consequently a large number of formulae for constructing index 
number have been devised of which some of the important 
methods are | 

1. Laspeyar's method 

Paasche's Method 
Dorbish and Bowley's method 
Fisher's ideal method 
Marshall-Edge worth method 
Kelly's method, and- 
Walseh's method 


d M M a 


1. Ley peyre’s price index number 
In this method, the base year quantities are taker as 
weights : symbolically, 


P= Pido 
i Po Qo 


x100 
2. Paasche’s Price Index Number 


In this method, the current year quantities are taken as 


weights; symbolically 
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Pi, 
Poi = *P.q 


3. Dorbish and Bowley's price index number 


This is an index number got by arithmetic mean of 


x100 





Laspeyre’s and Paache’s price index numbers symbolically. 


- (ze, LPiqg 
2 \EPoqdo == Pods 


4. Fisher's ideal price index number 


— 192) <100 


Fisher’s ideal index number is given by geometric mean 


by Laspeyre’s and Paache’s price index numbers symbolically 


YZPiqog 2P q: 
Poo Poq: 





x100 





Poi = 


5. Marshall’s — Edgeworth’s index number 

In this method also both current year as well as base year 
prices and quantities are considered the formula for constructing 
the index 1s 


p = (28 »Piqi 
= =Pp9dg Poq: 





)x100 


6. Kelly's price index number 
Kelly's index number uses quantities of some period as 


weights. This weight is kept constant for all periods, denoted by 





q. 
=P 

Py. = —4x100 
2Pn 


7. Walsch's price index number 
Walsch’s price index number is given by 


»Ps(Go + q1) 


Pa = )x100 
- »Pol{do+ gi) 
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Illustration 3 
Construct the index number of price from the following 


data by applying 


a) Laypeyre's method 

b) Paasche's method 

c) Bowley's method 

d) Fisher's ideal method and 

e) Marshall — Edgeworth method 


A 





Solution 


Calculation of various indices 











2011 
V0 do P; di 
| A | 2 | 8 | 4 | 6 ,32]| 16 | 24 | 12 | 
| B | 5 | 10 | 6 | 5 | œ | 50 | 30 | 25 | 
| D | 2 | 19 | 2 | 3 | 38 | 38 | 26 | 26 | 
P14o P:qi Po Wo 
—200 | =160 | —130 | =103 





a) Laypeyre's Price index 
LP 
Poi — iaio +100 
=P oqo 
| 200 
| 1600 


= 125 





x100 
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b) Paache’s price index 


= 126.21 


c) Bowley’s price index number 


zZP4Go, <Pigi 
—  £Pgag =PoGi 
Poi = k x100 


200 3120 


— 2460 ip x10 0 
z 


l1I.25-T i.26Z4 
= — 2, x1 00 


a 2-^x100 











— 125.6 
d) Fisher's ideal Index number 


= 41.25 x 1.262x 100 


— 4/1.578x100 
— 1.2561 x 100 
— 125.61 


e) Marshall — Edge worth method 
 fEPaiügtXPiai 
Poi = EPpd5--XPgd4 ) x100 
E 7 Cer)" 


160--10: 
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Weighted average of price relatives 
In practice, the Laspeyre’s price index is usually 
calculated using price relatives. For this method we have to use 
the expenditure in the base year as weights. Symbolically 
Weighted average of price relatives 


P 
Pa = > Where P = =*x100 
g 


V=Poq0 


Using Geometric mean 
ZViogP 
Pg, = antilog (= >E ee} 
Where P = “2 x100 
Po 


V —poqo 
Illustration 4 
Compute price index by applying weighted average of 


price relative method 


2012 2013 
Commodi | pree | Quatiy | prc | quami 


wine | 25 | 235 | 3 | 3 -— 
Le [oam | 9 [oe | 8 — 










Construction of price index 











a i 
Commodity = px100| V-Pud 
a DI m pes [me 


EPV=14340 
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Weighted average of price relative 


Po; = — Where p = =*x100 
V —Poqo 
_ 1434C 
/—— 1313.5C 
— 126.34 


Weighted aggregate quantity index numbers 

Price index numbers measure and permit comparison of 
the price of certain goods, the quantity index numbers permit 
comparison of the physical quantities of goods produced, 
consumed or distributed. The most common type of quantity 


index is that of quantity produced. 


G) Laspeyre's quantity index number 


2.04 Pg 
= ——x100 
Qo: ZOgPo 


(i1) Paache's quantity index number 


Qio MN X100 


flop 


Gii)  Bowley's quantity index number 


Süaapao 4 EgiPp 3 


E a dx j 
Ou = aopa Gabi x 100 
2 


Gv) Fisher’s ideal quantity index number 


¿qa p C 
Qoi = TT xP,——— iPa — x100 
zde Po To Pa 


(v) 


201Po + Xp, 
Qo; = = (=P T tdp: = 20 Pt) x100 
uim 20oPo ES dopi 
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Illustration 5 
Compute quantity index by 
a) Laspeyre’s method 
b) Paasche’s method 
c) Bowley's method 
d) Fisher ideal method 
e) Marshall Edge worth method 


Price 2012 2013 
Commodity P, Total Value Total value 
Poqo P14 


4 
9 
5 
32 
40 






© 
© 
þh 
NO 
NO — 

QN 


ON | > 


e 
I 
pà 


D 





0 
— m» [2 
(oc [me] » [3- 
( » |v 
i 





© 


Solution 


Construction of various quantity index 


324 400 360 


GoPo qıpı qopı 
= 1020 | = 1320 | = 1160 





a) Vayperye’s quantity index number 
201Dp.. 
= —— x100 
Qoi fub. 


— 1954100 = 113.70 
£0206 
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b) Paache’s quantity index number 


Ooi = 2a Pa —— € 100 
ZügP4 


— 122032100 = 113.80 
160 


c) Bowley's quantity index number 


i zq- p 2046 
Qo —— Il 9. x100 
= M«LK1gPo 


Eo £3220 
= 2220) x100 
E 1260 





d) Fisher ideal quantity Index number 


zq P Iq P 
Qoi = I> x — — x100 
$üpoPo  L-GoPi 


[1160 4320 


4 1ozo^ 4160 


= 11370 


x100 








e) Marshall’s — Edgeworth’s quantity index number 


Pa4--Edj4eg 
Cipi — (RISE x10 0 
zn Patopnd 


d160--13z2 
= = )x100 
AOoz20-riieoed]g 


- == x100 
1890 





Value index 
Value index number 1s easy to calculate, here, value 1s the 


product of price and quantity. The value of index V 1s the sum of 
the value of index V is the sum of the value of the current year 1s 


divided by sum of the value of the base year. 


Walue index V — (==) xi 00 
Og 
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Tests of consistency of index numbers 
Several formulae have been suggested for constructing 
index numbers and the problem is that of selecting the most 
appropriate one in a given situation. The following tests are 
suggested for choosing an appropriate index. 
1. Unit Test 
2. Time Reversal Test 
3. Factor Reversal Test 
4. Circular Test 
Unit Test 
The unit test requires that the formula for constructing an 
index should be independent of the unit in which or for which, 
prices and quantities are quoted. Except for the simple 
(unweighted) aggregate index all other formulae discussed in this 


chapter satisfy this test. 


Time Reversal Test 

It is a test to determine whether a given method will work 
both ways in time forward and backward. The test provides that 
the formula for calculating the index number should be such that 
two ratios; the current on the base and the base on the current 
should multiply into unit. It other words, the two should be 
reciprocals of each other. 
Symbolically 

Po, x Pio = 1 


Fisher’s ideal formula satisfies the time reversal test. 


Paama {rPido,, =Pidi |TPodi. Pode 
a1*10 Zpoüo Dod: 4} =P1 41 Ep qda 


2P4do ipat pota IP odo 


= AD eee 


Zpgdo =Po91 EPadı EPop4da 
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= V1 = 1 
Ge) Po, x Pio = 1 
The test is not satisfied by Laspeyre’s method and the paache’s 


method as can be seen below when Laypeyres method is used 


=Pidp 
ZPgag 


=P yaa - - . 
= ] to O 
Pio TP.q. changing time (i.e) O to 1 and 





ZPa4Odgg — ¿poña 


Pa XP = sec 
01 10 Speedo Ipd 
When Paache’s method used 
¿pala 
Poi = - 
Fola 
Dodo 
P — M 4 
iu 22p4üo 
rpij4d Epon , 
SP 904 Paio 
f=p.q 2P 14 
Note: Bj, = [2 x——— x100 


al ¿Poño =Pod: 


S a ee e 
Ppi Lp P oao 

P,, = IZ x 2100 
2P490, 26.4190 


Changing time (1.e) O to 1 and 1 to O. 
Factor Reversal test 

Another basic test is that the formula for index number 
ought to permit to interchanging the prices and quantities without 
giving inconsistent results. (1e) the two results multiplied together 
should give the true value ratio. A good index number should 
satisfy not only the time reversal test but also the factor reversal 
test. A good index number should allow time reversibility 
interchange of the base year and the current year, without giving 


inconsistent results. 
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2. pa Qu 
2. po Go 


The factor reversal test is satisfied only by the Fiser's ideal 
index. 


PoiXQo1 = 


For, 








Lpa4 = 
Es "EN aU x E aada 
2Ped5  -Pp53 


Changing P to q and q to p 








— IUüa4DPog  L-g4p4 
Qoi — 









ÉZp141d34  -di4Pg B 2404P4 
2PoGo0  £Poda4  LUgPo  XüsgP4 


4. Circular Test 

Another test of the adequacy of the index, number formula 
is what is known as ‘circular test’. If in the use of interest 
attaches not merely to a comparison of two years but to the 
measurement of price changes over a period of years, it is 
frequently desirable to shift the base. A formula 1s said to meet 
this test if for example, 2013 index with 2010 as the base 1s 200 
and the 2010 index with 2007 as the base must be 400. Clearly, 
the desirability of this property is that it enables us to adjust the 
index values from period to period without referring each time to 
the original base. A test of this shift ability of base is called the 
circular test. 

Symbolically, if there are three indices Poi, Piz and P-o the 


circular test will be satisfied if 


177 


Po. X Piz X 29; = 1 


Illustration 6 
Construction Fisher ideal index from the following data 


and show that it satisfies time reversal test and factor reversal 





Pidi Poqo 
=2172 | =1914 


Fisher’s ideal index 


[paa EP 
py, — [Pat xe Pa 100 
Polo {Pods 


7 
- J cae £172 X100 








1774 1914 
— 1.135 x 100 
= 113.5 
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Time reversal Test 
Time reversal test is satisfied when Po; x Pio = 1 


2p4 Zp = = 
PoiXPio = 1p 3 4453 x Poia x odo 
2Piodp 2P 95 P4, 27 p4üo 


2015 2172 19014 1774 
eom X x x 
i774 329134 22372 20:15 


=I 














= 1 


Hence time reversal test is satisfied by the given data. 


Factor reversal test 


pala 


Factor reversal test is satisfied when Poi*Qoi = nm: 
gu 


YDa4u 2p4q 24a p ida P 
XUI gr a= x a D x 1# i 


P.. = 
Ol |Epgüg  *Pg38.  ZügPgo  XdgPi 


Í2015 2172 1914 2172 
x x x 
1774 ' i014 1774 2045 

















| if2172X47 

v = 

£172 3 =Pids 
1774 Tpoq,5 


— ¿Pta 
7 FoixQoi — L5 do 


Hence factor reversal test is satisfied. 
Illustration 7 

Calculate fisher's ideal index from the following data and 
prove that it satisfied both the time reversal test and íactor 


reversal test. 
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raa Espen | Tice. | Espen 











= 4/1.2726x100 
= 1.128x100 
= 112.8 


Time Reversal test 


Time reversal test 1s satisfied when P»9;xPi9 = 1 


*Pido 2 Pidi , 8 (Epoq: 2.po do 


10 


Xpodo “Epod : JEma 2P do 





Poi = 






*Pido | 2D: qi „Pods pedo 
Xpodo 2. pod; "Xp. 2.p4 do 
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451 476 426 396 
EON E 


4306 426 476 451 
= ] 
Poi x Pio = 1 


Factor Reversal test is satisfied when 


P. = 2pigo +Pidi J#41P0  242Pi 
oa 1 a. Xop a, 910 = I5a.0. Žž Fao. 
Podo 2-Podi JoPo ~doPi 
2.pa4 do " 2Zpaidgi X diPo zdp 
Zpgdo Xpogi XgoPo doPi 


Po91xQo,o4 = 


451 476 2426 3176 


aa 
OB 
e 





x x x 
396 426 396 451 


CAT €) = 
£3296)" 











Hence factor reversal test is satisfied. 
Chain Base Index 

The base may be fixed on changing. So far we have used 
fixed base method in various formulae. In the fixed base method, 
the base remains constant throughout, ie., the relative for all the 
year is based on the prices of that single year. On the other hand, 
in the chain base method, the relative for each year is found out 


from the prices of the immediately preceding year. 
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The indices, which we found out by this method, are 
called link relative index numbers or link relatives. The links are 
liked together for example, if we are to compute the index 
number for 2008, the base year is 2008; and index number for 
2010, the base year is 2009, for 2011, the base year is 2010 and 
so on. The following formula is used for finding out the chain 


index. 


Current year link relative x Previous year chain index 


Chain index — 
100 
Illustration 8 
Construct (a) fixed base and (b) chain base index numbers 


from the following data relating to production of electricity. 


Year : 2004 2005 2006 32007 2008 2009 2010 2011 2012 
Production : 25 27 30 24 28 29 31 35 40 
(1000 kwt) 

Solution 


(a) Calculation of fixed base index number taking 2004 as 
base 


1000 kwt 2004) as Base 
: 
: 
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In above, 


Current vear production 
Fixed base index Number = ini dci. alo 


Base year production 


b) Calculation of Chain Base Index Number 


diais bae A ON IN MO MUT EP 
u 100 


Link Relative Chain indices 
me = year x100 
are vinos year 
100 






Production 
(1000 kwt) 


2004 2 


& 
R 
A 
bl 
| 


27 408x100 
30 111.11x108 


2007 2 


IN 


eis 
24 80x120 
NT —— —— = 96 









Loc 
116.67 x 96 E 








29 103 .5x112 

2009 29 — x100 = 103.57 — 116 
31 106 .90x116 
35 112 .90x124 

2011 35 ~ x100 = 112.90 | ——————— = 140 





2012 


] abil 
AO 114 .29x140 
40 — x100 = 114.29 = 160 


Conversion of chain index to fixed index 
At times it may be desired to covert the chain base index 
numbers into fixed base index numbers in such a case the 


following procedure is followed. 
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a. For the first year the fixed base index will be taken the 
same as the chain base index. However, if the index 
numbers are to be constructed by taking fixed year as the 
base in that case the index for the first year is taken as 100. 

b. For calculating the indices for other years the following 


formula is used. 

Current year'sCBI x Previous year’sFBI 
: 160 

F.B.I. = Fixed Base Index Number 


C.B.I. — Chain Base Index Number 


Current year'sFBl = 


Illustration 9 
From the chain base index numbers given below, prepare 


fixed base index number. 


Year : 2008 2009 201O 2011 2012 
C.B.I : SO 110 120 90 140 
Solution 


Computation of Fixing Base Index Numbers 


Current year CBI x Previous year FBI 


F. B.I = 
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Consumer price index numbers 

Consumer price index is also called as the cost of living 
Index. Statisticians recommend that the term ‘cost of living 
index’ or ‘Price of living index’ or ‘Cost of living Price index’ or 
consumer price index can be used in appropriate place. In 
different countries, cost of living index consumer price index and 
retail price 1ndex are used. 
Uses of consumer price index 

1. This is very useful in wage negotiations and wage 
contracts and allowance adjustment in many countries. 

2. Government can make use of these indices for wage 
policy, price policy, taxation general economic policies 
and rent control. 

3. Changes in the purchasing power of money and real 
income can be measured. 

4. We can analyze the market price of particular kinds of 
goods and services by this index. 

Construction of a consumer price index(CP1) 
There are some precautions to be taken in the consumer of 
price index. 

1. Determination of the class of people 

2. Selection of based period conducting family budget 
enquiry. 

Methods of constructing price index 
There are two methods of constructing price index. They 
are 

1. Aggregative expenditure method 

2. Family Budget method 
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1. Aggregative Expenditure method 
This method is based upon the Laspeyre’s method. It is 


widely used. The quantities of commodities consumed by a 
particular group in the base year are the weight. 


2;:P1do z 
> Padna 


Consumer Price Index Number = 100 


2. Family Budget Method 
Here an aggregate expenditure of an average family on 


various item is estimated and its value weighted. Formula is 


a) Consumer price index number = = 
Where P = = X100 ; P, — Current year price 
V —pogo PO — Base year price 
b) Consumer priceindex number — = 


Where P = =~? x100 
o 


Illustration 10 
Compute consumer price index number from following. 










Base year 
price 









Group 
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Solution 
Construction of consumer price index 


Curren 
t Year 
Price 







Consumer Price Index Number 
TP 
== > BY 
13862.5 
^ . 100 





= 138.625 


Illustration 11 
Construct a consumer price index number from the table given 


below. 


Misc 
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Solution 
Calculation o Cost of Living Index 


Index for Expenditure 
Group 2010 (70) 
Y W 


emma | 25 | w | aw — 
ooo [1 wo | ws 


Cost of Living Index Number = 











Illustration 12 
Compute the cost of living index number using both 
aggregate Expenditure method and family Budget method from 


the following information. 
vu Price in Price in 
Commodity Consumption e 
: base year Current year 
in base year 





Solution 


Calculation of cost of living index by Aggregative 
ee method 














Ciothing 


Aggregative Expenditure method cost of living index number 


— Eao 2100 








_ 3085 
Zulu] 





X100 = 135.9 


Cost of — index by family Budget method 


nau FIERE CET Mrd 
LU 
O O 
roo [son 
E 
2.9 
10 





maa | so faco Cw» [senes 
fone fo PE pE aso T oo | SOTO 
Sugar 4o | 2.50, E ari 
Sugar 

on e[tepue| e |g 
mec fofo 


PV-— 


3,08,50 








EPV 
Family Budget Method Cost of Living Index Number = aep 
|. 8,08,500 
| 2270 
= 135.9 


Production index: Index of industrial production 

lhe index of the industrial production is aimed at reflecting 
changes (increase or decrease) in the volume of industrial 
production (1.e., production of non agricultural commodities) in a 
given period compared to some base period. These indices 
measure, at regular intervals, the general movement in the 
quantum of industrial production. Such indices are useful for 


studying: 
1) The progress of general industrialization of a country, and 


1i) The effect of tariff on the development of particular 
industries. 


These indices of industrial activity are of great importance n 
the formulation and implementation of industrial plans. For the 
construction of indices of industrial production, the data about 
production of various industries are usually collected under 


following heads: 
1) Textile industries: cotton, silk, woolen, etc. 


11) Metallurgical Industries: Iron and steel, etc. 

111) Mining Industries: coal, pig-iron and ferro-alloys, petrol, 
kerosene, copper(virgin metal) etc. 

iv) Mechanical Industries: Locomotives, sewing machines, 
aerolanes, etc. 

v) Industries subject to excise duty: Tea, sugar, cigarettes and 
tobacco, distilleries and breweries, etc. 

vi) Electricity, gas and steam: Electric lamps, electric fans, 
electrical apparatus and appliances, etc. 

vii) Miscellaneous: glass, paints and varnish, paper and 
paperboard, cement, chemicals etc. 
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Usually, the data (figures of output) are obtained for 
various industries on monthly basis and the indices of 
industrial production are obtained as the weighted 
arithmetic mean (or sometimes geometric mean ) of the 
production (quantity) relatives by the formula: 


I Q,W, 
of 
W, 
Where O,- production relative— Era A 
doj 


And W, is the weight assigned to jth term (industry) 


The weights may be assigned to various industries on 
the basis of, say, capital invested, net output, 
production etc. The concept of ‘value added by 
manufacture’ is the most commonly used criterion for 
determining the weights to be assigned to different 
industries. 


Interim Index of Industrial Production (New Series): This official 
index number of industrial production using 1946 as base year 
was started (by the office of the Economic Adviser, Ministry of 
Commerce and industry, Government of India) from January 
1949 and consisted of 20 items covering mining and 
manufacturing. The “value added by manufacture”, as obtained 
from the First Census of Manufacturers 1946was used for 
determining, the weights to be assigned to different items. The 
index of monthly production was computed by the formula 





W.O. 
Ty ——Hf£ 100. 
: W, 
Where Q,= quantity relative for the jth industry for the month in 
question= dy 
d oj 


W ,= weight allotted to jth industry 


The adjusted index number, after allowing for variation in the 
number of days in the month for all the industri es except sugar, 
was obtained by the formula 


QixAverage noO Bays of m Tn. cité: Ina year 
h.? ire... a MM M iL 


2; Cealemder no of days ibn the month 


Where Q, and QO, are adjusted and un-adjusted quantity relatives 


respectively. 
For sugar, the seasonal variation adjusted indices were obtained 
by the formula 
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w, 2% 3.54 
ay, 
Where w, is the adjusted weight; q,, is the production in the 
corresponding month of the base year , a,, is the average monthly 


production in the base year and 3.54 1s the weight given to sugar 
industry in the overall weighting pattern. 
Example 
Calculate Index numbers of foodgrains production for the years 
1980-81 and 
1981-82 with base 1979-80=100 from the following data: 
Foodgrains | Weight 
42 53 54 


12 29 32 35 












Rice 
Wheat 
Jowar 
Bajra 
Other 
cereals 
Pulses 















Solution. The indices food of grains production for the years 

1980-81 and 1981-1982 with base year 1979-80, using fixed 

weights are obtained by Kelly’s fixed weight formula. 
CALCULATIONS FOR FOODGRAINS 


PRODUCTION INDICES 
Food Weight a. aa Weight Production 
Grains (W) a. aa tones) 


1979-80 | 1980-81 | 1981-82 
(042 E 42 
1. 1. Rice | B. AN WEE ES. 1428 | 1802 1836 


5. Other 6 1] 12 13 72 78 
cereals 

10 11 12 100 110 120 
"uw 








quW 244 

Q* ————. 100 3 100 —121.2 
Tow 2015 
q,W 2532 

O. =_ = 1006 100 125.6 
ü dW 2015 


Example 


Index of Industrial Production covers three groups of industries. 
This index increased from 106.4 to 150.2 from one point of time 
to another. The index numbers of individual three groups of 
industries over the same period changed as follows: miming and 
Quarrying from 102.0 to 144;Manufacturing from 106.5 to 146.6; 
Electricity from 110.4 to 189.9.Determine the weights for the 
individual groups of industries. 
Solution: Let the weights of three industries Mining and 
Quarrying, Manufacturing and Electricity be x,y and z 
respectively such that 

x+y+z=100 z—100-x-y 
CACULATIONS FOR INDEX NUMBERS OF INDUSTRIAL 
PRODUCTION 


(100-x- 
y) y) 





Initial index is given by : 
TW  102x 16.5» 11.4(00 x y) 
Ww 100 
8.2x 3.9y 11040 100 106.4 10640 


106.4 (Given) 


8.2x+3 .9y-400=0 is) 
Final index is given by 


I.W 144.1x 146.6y 189.9000 x y) 


150.2 (Given) 
W 100 
45.8x 43.3y 18990 150.2 100 15020 
45.8x+43.3y-3970=O ...... (2) 
Multiplying (1) by 45.8 and (2) by 8.2, we get respectively 
375.56x+178.62y-18320=0 ..-(3) 
375.56x4355.06y-32254—0 (T 
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(4)-(3) gives 


176.44y-14234=0 


Ame. Bod? «iti 
176.44 


Substituting in (1) we get 


8.2x+3.9 $80.67-400—0 
8.2x—400-314.61—85.39 


m 10.41 : ...(6) 
8.2 





Substituting in (8) , we have 


Z=100-x-y=100-10.41-80.67=8 .92 

Hence the weights of three industries are: 
Mining and Quarrying: 10.41 10 
Manufacturing : 80.67 81 
Electricity : 8.92 9 


Limitations of Index Numbers 


Even though index numbers are very important in business 


and economic activities, they have their own limitations, they are 


L. 


There may be error in each stage of the construction of the 
index number, namely selection of commodities, selection 
OÍ base period, selection of weight etc., 

Index numbers may not present the exact change in price 
level, because they are based on sampled data. 

lastes, habits and customs of people change in course of 
time and may make the weighting not suitable for the 
present data 

In each index there is an index, error, because there is no 
formula for measuring the price change. So there is the 
formula error. Hence it will not be a representative one. 

By selecting a suitable year as the base year, selfish 


persons may get their desired results. 
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TIME SERIES 

One o the most important tasks before economists and 
businessman these days is to make estimates for the future. Or 
example a business man is interested in finding out his likely 
sales in the year 2008 or as a long-term planning in 2020 or the 
year 2030 so that he could adjust his production accordingly and 
avoid the possibility of either unsold stocks or inadequate 
production to meet the demand similarly, an economist in 
interested in estimating the likely population in the coming year 
so that proper planning can be carried out with regard to good 
supply jobs for the people etc., However, the first step in making 
estimates for the future consists of gathering information from the 
past. In this connection one usually deals with statistical data 
which are collected, observed or recorded at successive intervals 
of time. Such data are generally referred to as time series. 
Definitional of the term time series 

A time series is a set of observations arranged in 
chronological order'. 
Uses of time series 

Time series analysis is useful in different fields like 
economics, science, research work, etc., because of the following 
reasons. 

1. It helps in understanding the past behavior. 

2. It helps in planning and forecasting 

3. Comparison between data of one period with another 

period is possible. 
4. It is useful not only to economists but also to the 
businessman. 


5. It helps in evaluating current accomplishments. 
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Types of variations or components of time series 
The components of a time series are 

Secular trend 

Seasonal variation 


Cyclical variation and 


- m 


Irregular variations 

There are four basic types of variation and these are called 
the components or elements of time series. 

Components 


MA 


Long-Term 


Y Short-Term l poe 


Secular Cyclical Seasonal 

Irregular 

(Erratic) aa le 
Regular 


1. SECULAR TREND 
The general tendency of the time series data to increase or 


decrease or stagnate during a long period of time is called secular 
trend, also known as long term trend. The steady increase in the 
cost of living recorded by the consumer price index is an example 
of secular trend. From year, individual year, the cost of living 
varies a great deal, but we examine a long term period, we see 
that trend is toward a steady increase. It is an increasing but 


fluctuating time series. 
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Uses of Trend 


1. The trend describes the basic growth tendency ignoring 


short term fluctuations. 
2. It describes the pattern of behavior, which characterized 
the series 1n the past. 
3. Future behavior can be forecasted in the assumption that 
past behavior will continue in the future also. 
4. Trend analysis facilitate us to compare two or more time 
series over different period of time and this helps to draw 
conclusions about them. 
2. SEASONAL VARIATIONS 

These are small time fluctuations due to the peculiar 
conditions caused by seasonal effects. For example the price of 
rice will go down during the harvest season and again there will 
be rise after this season. This change occurs periodically. These 
kinds of variations changing with the seasons are called seasonal 
variation. 
3. CYCLICAL VARIATIONS 

This occurs primarily in economic time series and is 
similar to the seasonal variations. However it occurs periodically 
at intervals such longer than that found in seasonal variations. For 
example due to booms and depressions there will be steady 
upward and downward movements in economic time series. 
Irregular variation 

Due to war, earth quakes, floods and random factors, time 
series will change in an irregular fashion. Such changes are called 
irregular variation. 
Methods of estimating trends 

There are four methods of estimating secular trend 


1. Free hand or Graphic method 
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2. Semi-average method 

3. Moving average method 

4. Method of least squares 
1. Free hand curve (or) Graphic fitting method 

This is the easiest, simplest and the most flexible method 
of estimating secular trend. In this method we must plot the 
original data on the graphic. Draw a smooth curve carefully 
which will show the direction of the trend. In this the time is 
shown on the horizontal axis and the value of the variable is 
shown on the vertical axis. 


Illustration 
Fit a trend line to the following data by the tree-hand 


method. ^ 

Year 2006 2007 2008 2009 2010 2011 
2012 

Sales > w GS 95 85 115 110 120 
130 


(1000 units) 


TEE DEEP REM Ai Aa a scp a CC EN 
E 
4 
l - 
I 

i - 


130 M 


2006 
2007 
2008 
2009 
2010 
2011 
2012 


LL Tab he ere II HERI tt H ranr men nme H ^ " 
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Merits : 

1. It is the simples, earliest and quickest method. It saves 
times and labour. 

2. It is adaptable and flexible and it can be used to describe 
all types of trend (ie) linear and non linear. 

3. Experienced statistician can draw a free hand line more 
accurately than a mathematician this can widely be used in 
applied situations. 

4. It will help to understand the character of time series and 
we can use appropriate mathematical trend. 

Demerits 

1. It is highly subjective. It is subject to personal bias. The 
results depends upon the judgment of the person who 
draws the line. There may be different curves for different 
persons. 

2. It seems to be very simple. But it requires more time for a 
careful job. 

3. If it is not drawn by experienced person, the it is 
dangerous to use for forecasting purpose. 


4. It does not help us to measure trend. 


2. SEMI AVERAGE METHOD 

In this method the original data are divided into equal 
parts and averages are calculated for both the parts. These 
averages are called semi averages. 


Merits 


1. It is simple and easier to understand than the moving 


average and least square method. 
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illustration 
Draw a trend line by the method of semi averages. 


Year 2006 2007 2008 2009 2010 2011 


2012 
Sales (1000) 
130 


110 105 115 112 120 118 


Solution 
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2006 
2007 
2008 
2009 
2010 
2011 
2012 
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3. Moving Average Method 

In this method, the average value for a number of years or 
months or weeks is taken into account and making it at the centre 
of the tim-span (period of moving average) and it is the normal or 
tend value for the middle period. 
Calculation of moving averages 


The formula for calculating 3 yearly morning average is 


agQ+hb+¢ Ec e+d-t+e 
3 i: 3 " ag 











The formula for calculating 4 yearly moving 


A+t+h+c+a b+c+d+e e+d+e+fF 
4. j 4 : 4 


2. As it does not depend upon personal judgment everyone 
who applies this method will get the same trend line unlike 
the former method. 

3. As the line can be extended both ways, we can get the 
intermediate values and predict the future values. 

Demerits 

1. Under this method, it has an assumption of linear trend 
whether such a relationship exists or not. 

It 1s affected by the limitation of arithmetic mean 
3. The method is not enough for forecasting the future trend 
or for removing trend from original data. 
Illustration 

Draw a trend line by the method of semi averages. 
Year 2007 2008 2009 2010 2011 2012 
Sales 1000  : 60 75 81 110 106 120 


S20 295 








ase M 
110 di 
» 
^^ 
P d 

| ^ 
"ed P d 
wa hr 

90 p 

^ 
Ps 
80 = 
F 
^ 
S 
70 
60 
= = e = = Sa 
ei oa e cea ei cq 
Year 


Merits 
1. It is sample and easy to understand it is easier to adopt 


when compared to the method of least square. 

2. It is more flexible that other methods. K is elastic in the 
sense items can be increased or decreased without 
affecting the moving average, the only snag 1s that we get 
trend values or less trend value. 

3. It is not only used for the measurement of trend but also 
for the measurement of seasonal, cyclical and irregular 
fluctuations. 


Demerits 
1. The main object of trend value is that it is used for 


forecasting or predicting future values, because this 


method is not represented by a mathematical function. 
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2. There is no rule regarding the choice of the number of the 
moving average and so the statistician has to use his own 
judgment. 

Illustration 
Calculate the 3 yearly moving average of the production 


figures given below. 


Year 2000 2001 2002 2003 2004 2005 2006 
2007 2008 | 

Production 

lonnes 15 Zl 30 30 42 46 50 56 
63 

Solution 


3 yearly 
moving 
average 


3 yearly total 
(m tones) 





Illustration 
Assuming a four yearly cycle calculate the trend by the 
method of moving average from the following data relating to the 


production of tea in India. 
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Year 2003 2004 2005 2006 2007 2008 2009 2010 


2011 2012 
Production 464 515 518 467 502 540 557 57l 
586 612 


Gan millions) 
Solution 


Trend method of moving average 


Year | Production totai moving average 
total centered 

| 5725 

DEGEN 

D 


2010 
2011 586 
2012 612 


T 
: 
j= 
: 
U 
N 
O 
IN 
Un 
60 
: 
T 
~x 
N 
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Method of Least Squares 

It is a mathematical as well as an analytical method. Under 
this method a straight line trend can be fitted to the given time 
series of data. This is a most important and accurate method of 
measures the long-term trend. In this method, we can fit either a 
straight line or a curve, which is the lies of best fit for the given 
data. 

By making use of the equation of this line we get the trend 


values. For all the given year and also we can predict or forecast 
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the future trend values. The equation of the line of best fit can be 
taken as y=atbx 

Where y denotes for the variable, x denotes the time 
period (which is deviation from a particular year) and a,b are 
constants to be foundout using the following tow normal 


equations 


y=na+b x 
and xy=a x+b x^ 


If we take x as the deviation from the middle year then 


x—O. In this case the normal eqn. becomes 





y—na and xy —b x^ 
= Ix 
TE 2w 


Substituting the value a and b we get the equation of the straight 


— line trend. 
illustration 


Fit a straight line trend for the following data 


Year 2005 2006 2007 2008 2009 2010 2011 
Production 
in tonnes 62 70 75 814 89 93 104 


Also find the tend values for the given years and estimates the 
trend value for the year 2014. 
Solution 

Let the equation of the straight line by y—at+bx. Where y 
denotes the production in tonnes. Here x is the deviation of the 
middle year. 


The normal equations are 


y=natb x 
xy—a xtb x^ 
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If ,=0, then 
a ELE az 
b = —* — 12 _ 6.643 
337 28 


The eqn. of the straight line trend is y=82+6.643x 
To find the trend values 
For the year 2005, y=82+6.643(-3) = 62.071 
For the year 2006, y=82+6.643(-2) = 68.714 
For the year 2007, y=82+6.643(-1) = 75.357 
For the year 2008, y=82+6.643(0) = 82.000 
For the year 2009, y=82+6.643(1) = 88.643 
For the year 2010, y=82+6.643(2) = 95.286 
For the year 20011, y=82+6.643(3) = 101.929 
To find the trend value for the year 2014, 

For 2012, x=4 

For 2013, x=5 

For 2014, x=6 

Then y = 82+6.643(6) 
= 82+39.858 
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= 121.858 
Estimated production for the year 2014 = 121.858 tonnes 
Illustration 
Using the method of least squares obtain the trend value 


for the following data. 


Year 2005 2006 2007 2008 2009 2010 2011 
2012 

Earning 38 40 65 72 69 60 87 95 

(Rs. Lakhs) 


Assuming the same rate of change continuous, what would 
be the predicted earning for the year 2014? 
Solution 
Let the equation of the straight line be 
y—adtbx 
The normal equations are 


y —natb x 


xy--a xtb x 
FITTING OF STRAIGHT LINE TREND BY THE 
METHOD OF LEAST SQUARES 


Earning 
CRs. Lakhs) 





If x=0 then 


n R 
* 308 
b= Hy = —— = 7.333 
xx 42 


The equation of the straight line trend 1s 
yoatbx 
y=—65 .754+7.333 x 
For 2014, x will be 5.5 
When x=5.5, y=65.754+7.333 x 5.5 
= 65.75+40.336 
= 106.086 
Thus the estimated earning for the year 2014 are Ks. 
106.086 Lakhs 
Parabolic curve | 
Parabolas are non-linear, they from into smooth curves. 
The shape of these curves depends upon the value of the 
constants a,b,c etc. The general form of the equation of the power 
series y=a+bx+cx*+dx°+.... The equation of this type does not 
represent a curve of strictly parable type but in common usage, 
the term parabolic curve is used to indicate curves obtained by 
equations of the type. The trend equation in this case is 
y. = a + bx + cx? 
Where a is the trend value at the time origin 
b 1s the slope at the origin and 
. € establishes whether the curve is up or down and 
how much. 
The value of a,b and c can be determined by solving three 
normal equations simultaneously. 


y=natb x+c x? 
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xy =a x+b x*+c x? 


x^y—a x*+b x*+c xí 

In solving the above second degree equations much time 
and labour can be save by taking the time origin in the middle of 
the series so that x—0. But if x=0 then the sum of any add 


powers of x such as x? is also zero. 


Therefore, the above three equations are reduced to 


y—na-cc x^ (1) 
xy —b x2 (2) 
x^y—a x^-c x^ (3) 


Solving equations (1) and (3) we obtain the value of a and 


c and the value of b is obtained by solving equation (2) 


——€— *y—cÓix? 
TE 
R3 ut. 
b v za 
" Ex?-y—azxx? 
ma 


Illustration 
Fit a parabola of the second degree to the data given 


below. 
Year 1996 1997 1998 1999 2000 
Sales 
O 24 
(1000) 16 18 19 2 
Solution 








Let the equation of the second degree parabola be y=a+bx-+cx 


The normal equations are 
2 


y=natb x+c x 
xy —a x+b x*+c x? 
x^y—a x?^-b x?^-c x^ 


If x=0 then 
Ey-—crx? 
i BI" EE 
97—ch10)3 
SM 
D';—10c 
5 


Sa = 97-10c 


Sat+10c = 97 (1) 
= y — Ls — .G 

Xa 10 

Ex? y— arx? 198—ao/(10) 
pe xpo qr oc ee ee 

Ex aA 

34c — 198-10a 
10a+34c=198 (2) 


Solving (1) and (2) 
(1)x(2) 10a+20c= 194 


LtOacG--2ic—iS9t£ 


(2) 14r—4 
Subtracting, 
c=—=0.29 
14 
C=0.29 


Substituting the value of C in equation (1) 
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5ac-10(0.29) = 97 
5at2.9 = 97 
Sa=97-2.9 
5a—94.1 


a= == 18.82 


a—1$8.82; b—1.8; c=0.29 

y—18.82--1.8x--0.29x^ is the required trend equation 
required trend equation. 

Trend values are 

y1996 = 18.82+1.8(-2)+0.29(-2)°=16.38 

y1997 = 18.82+1.8(-1)+0.29(-1)°=17.31 

y1998 = 18.824-1.8(0)4-0.29(0)? —18.82 

y1999 = 18.82+1.8(1)+0.29(1)°=20.91 

y2000 = 18.82+1.8(2)+0.29(72yY=23.58 
Mieasurement of seasonal variations 

In a time series seasonal variations come into force in 
regular periods. There is monthly or quarterly seasonal variations 
in the economic and business phenomena. 

The objective of studying seasonal variation is to 
determine the effect of seasonal variation on the value of given 
phenomenal and to eliminate them (1e) determining the size of the 
value of variables. 

The study of seasonal variation is important in deciding 
the business policy of various firms. The time series data are 
recorded monthly, quarterly, weekly, daily or hourly. There will 
be difference in them due to seasonal variations. 

There are four methods. They are 

1. Methods of simple average 


2. Ratio to trend method 
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3. Ratio to moving average method 
4. Link relative method 
Method of Simple Average 
This is the simplest and easiest method of calculating a 
seasonal index. The steps are 

1. Average the data for each month or quarters for all the 

years. 
Find the total of each month or quarter. 

3. Divided each total by the number of years for which data 
are given. If we are given monthly data for 4 years, we 
must first get the total for each month for 4 years and 
divide each total 4 to get an average. 

4. We can get an average of monthly averages by dividing 
the total of monthly average by 12. 

5. We must taken the average of month averages as 100 and 


get the seasonal index as follows. 


Seasonal index for January = 
for] y Average of monthly average 


Illustration 


Compute the average seasonal movement seasonal index 
for the following data. 
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Monthly average of January 
T a A I x 


100 


Solution : Computation of Seasonal Indices 


Seasonal 
Index 





Quarteriy average 
Seasonal index = — OO ~® 22100 
General average 








.675 
Seasonal index for the first quarter — a x100 
= 98.66 
25 
Seasonal index for the second quarter — "emm x100 
= 110.74 
- .55 
Seasonal index for the third quarter = Uv x10 


= 95.30 
SUMMARY 
From this unit we are able to understand what is index 
number, how it is constructed, how it is tested for perfection, 
and how to construct consumer price index. 
KEYWORDS 
1. Weighted index numbers 


2. Aggregate index numbers 
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3. Value index numbers 
4. Family based budget 


5. Consumer price index 


FURTHER READING 

1. R.S.N. Pillai, Business Statistics 
2. S.P. Gupta, Statistical Methods 
3. Elhance, Statistics. 


KEY TO CYP QUESTIONS 
1. From the following data, compute price index by Simple 
Average of Price Relatives method based on (a) 


Arithmetic Mean, and (B) Geometric Mean. 


Butter 
Cheese 
Milk 
Bread 


Eggs 
Ghee 





Answer: 
(a) Arithmetic mean=104.67 
(b) Geometric Mean = 104.57 


2. Construct Index Numbers of price from the following data 
by applying: 
1. Laspeyre’s method 
2. Paasche’s method 
3. Dorbish and Bowley’s method 
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4. Fisher's Ideal nethod, and 
5. Marshall- Edgeworth method 


2004 2005 





1. Laspeyre's method = 125 

2. Paasche’s method = 126.21 

3. Dorbish and Bowley’s method = 125.61 
4. Fisher's Ideal nethod = 125.60, and 

5. Marshall- Edgeworth method = 125.48 


EXERCISE 


1. From the following data construct on index number for 
1996 taking 1995 as base 


Commodity and Unit | Price in 1995 | Price 1n 1996 


Butter(kg) 
Cheese 
Mülk(l1t) 


Bread(1) 
Eggs(doz) 
Ghee(l tin) 





2. Construct index number of price from the following data 
by applying. 


l. Laspeyre's method 
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2. Paasche’s method 

3. Dorbish and Bowley’s method 
4. Fisher’s Ideal nethod, and 

5. Marshall- Edgeworth method 
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UNIT 5: TESTING OF HYPOSTHESIS 

Introduction 

A hypothesis is an assumption to the tested. The statistical 
testing of hypothesis is the most important technique in statistical 
inference. Hypothesis tests are widely used in business and 
industry of making decisions. It is here that probability and 
sampling theory plays an every increasing role in constructing the 
criteria on which business decisions are made. Very often in 
practce we are called upon to make decisions about population 
on the basis of sample information. For example, we may wish to 
decide on the basis of sample data whether a new medicine is 
really effective in curing a disease, whether one training 
procedure is better than another etc., such decisions are called 
statistical decisions. 
Unit objectives 

This unit addresses to the following objectives. 

To understand the hypothesis testing procedure. 


To know the concepts of parametric tests. 


Unit Structure 
Introduction 
Unit Objectives ] 
Unit structure 
Hypothesis testing 
Test of significance of large samples 
t-test 
f-test 
Chi-square test 


Exercise 
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Hypothesis Testing 

In an statistical investigation the interest usually lies in the 
assessment of the general magnitude and the study of variation 
with respect to one or more characteristics relating to individual 
belonging to a group the group of individual under study is called 
population or universe. Population may be finite or infinite. 

A finite subset of statistical individuals in a population 1s 
called sample. 

The number of individuals in a sample is called the sample 
size. 
Parameters and Statistic 

In order to avoid verbal confusion with the statistical 
constants o the population, namely mean p, variance ? which are 
usually referred to as parameters. Statistical measures computed 
from sample observations alone. E.g mean (x), variance (.s? ` etc 
are usually referred to as statistic. 
Null Hypothesis 

For applying the test of significance, we first set up of a 
hypothesis — a definite statement about the population parameter. 
Such a hypothesis is usually a hypothesis of no-efficient and it 1s 
denoted by Ho. 

In case of a single statistic, Ho will be that the sample 
statistic does not differ significantly from the hypothetical 
parameter value and into the case of two statistics (Ho) will be 


that the sample statistic do not differ significantly. 
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Alternative Hypothesis (H1) 


Any hypothesis which is complementary to the null 
hypothesis is called an alternative hypothesis, usually denoted by 
H1. 

For example, if we want to test the null hypothesis that the 
population has a specified mean po (say) G.e) Ho:p-pno, then 
alternative hypothesis would be 

Q) H, : uxo 

GD Hii: pepo 

Qu) Hi: uho 

The alternative hypothesis (1i) is known as a two tailed 
alternative (ii) is know as a right tailed alternative and (11) is 
known as left tailed. 

The setting of alternative hypothesis is very important to 
decide whether we have to use a single-tailed (right or left) or 
two tailed test. 

Errors in Sampling 

The main objective in sampling theory is to draw valid 
inferences about the population parameter on the basis of the 
sample results. In practice we decide to accept or to reject the lot 
after examing a sample from it. As such we have two types of 
error. 

(1) Type I error : Reject Ho when it is true 

(1) Type II error : Accept Ho when it is wrong 
Critical Region 

A region corresponding to a statistic in the ample space S 
which lead to the rejection of Ho is called critical region. Those 
region which lead to the acceptance of Ho give us a region called 


acceptance region. 
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Level of Significance 
The probability * °’ that a random value of the statistic t" 
belongs to the critical region is known as the level of 
significance. In other words, level of significance is the size of 
the Type I error. The levels of significance usually employed 1n 
testing of hypothesis are 5% and 1%. 
Procedure for testing of Hypothesis 
We now summarize below the various steps in testing of a 
statistical hypothesis in a systematic manner. 
G) Set up the null hypothesis 
(i) Choose the appropriate level of significance (either 5% or 
1% level) this is to be decided before sample is drawn. 


(111) Compute the test statistic 


_ t—E(t) . 
— SEC) under the null hypothesis 





(iv) We compare the computed valve of z in step (111) with 
the significance value at given level of significance. 
If |Z|<1.96 Ho may be accepted at 5% level of 
nT, ee 
If |Z\>1.96, Ho may be rejected at 5% level of 
significance. 
If |Z|-2.58, Ho may be accepted at 1% level of 
significance 
If |Z|>2.58 Ho may be rejected at 1% level of 


significance 


For a single tailed (Right or Left Tail) we compare the 
computed value of |Z| with 1.645 (at 5% level) and 2.33 at (1% 


level) and accept or reject Ho accordingly. 
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Large Samples 
Test the significance of Large samples. 
If the size o the sample n—30, then that sample is called 
large sample. 
There are four important test to test the Significance of 
large sample. 
1. Test of significance for single proportion 
2. Test of significance for difference of proportion 
3. Test of significance for single mean 
4. Test of significance for difference of mean 
Test of Significance for Single Proportion 
Suppose a large sample of size n is taken from a normal 
population. 
To test the significant difference between the sample 
proportion p and the population proportion P, we use the statistic 


p-—P 


A= iz Where n is the sample size. 
Eu 


Notes: Limits for population proportion P are given by P 4 3 -s 


where q-—1-p. 
IJHustration 1 

In a sample of 1000 people in Karnataka 540 are rice 
eaters and the rest are wheat eaters. Can we assume that both rice 
and wheat eaters are equally popular in this state at 1% level of 
significance? 
Solution 

Given n = 1000 


p = sample proportion of rice eaters 


540 
|^ 1000 


— 054 
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P = population proportion of rice eaters 


Q=1-P=1-0.5=0.5 
Null hypothesis 
Both rice and wheat eaters are equally popular in the state 
Ge) Ho : P=OS5 
Alternative hypothesis H, : PzO.5 (two tailed test) 


" : É32—H7 0,2841—40.5 
Test Statistic Z = E E eee 











IPO 19.5 40.5 
Ne 72 ^; 1000 
= 2.532 


Calculated value of Z=2.532 
The tabulated value of Z at 1% level of significance for two tailed 
test 1s 2.58. 
Calculated value of Z<Tabulated value of Z 
Accept the Null Hypothesis Ho 
Both rice and wheat eaters are equally popular in the 
state. 
Illustration 2 
In a big city 325 men out of 600 men were found to be 
smokers. Does this information the support the conclusion that 
the majority of men in this city are smokers? 
Solution : Given n—600 


p = simple portion of smokers 


|| 325 
600 


— 0.5417 





P — Population proportion of smokers in the city 
E: 
Z 


= 0.5 
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Null Hypothesis Ho 

The number of smokers and non-smokers are equal in city 
Alternative Hypothesis 

H,:P>0.5 (Right-tailed) 
Test Statistic 

P—Q _ 0.5417-0.5 _ 

Calculated value of Z = 2.04 

labulated value of Z at 5% level of significance for right 
tailed test 1s 1.645. 

Since calculated value of Z > Tabulated value of Z 

We reject the null hypothesis. 

Ge) The majority of men in the city are smokers 
Illustration 3 | 

A random sample of 500 pineapples was taken from a 
large consignment and 65were found to be bad. Find the 
percentage of bad pineapples in the consignment. 
Solution 

Given n—500 


P — Proportion of bad pineapples in the sample 
65 


= — = 0.13 
5060 
q = 1-P = 0.1-0.13 
— 0.87 


We know that the limits for population proportion P are 


given by P + 3 E 


— 0.13 + Q.1 3 x (31.87 
DS n Sob 


— 0.13 + 0.45 
= (0.13 + 0.045, 0.13-0.45) 
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= (0.175, 0.085) 
The percentage of bad pineapples in the consignment 
lies between 17.5% and 8.5%. 
Test of Significance for difference of proportion 
Suppose two large samples of sizes mn and n> taken 
respectly from 2 different populations to test the significant 
difference between the sample proportion P, and P5 and 


Pi- Pa 


MEE de. 1 
JP SÍ n 


Mara tng Pa 


na + FE cx 


Where p = and q = 1 — p 


illustration 4 

Random sample of 400 men and 600 women were asked 
whether they would like to have a flyover near their residence. 
200 men and 325 women were in favour of the proposal. Test the 
hypothesis that proportions of men and women in favour of the 


proposal are same at 5% level. 


Solution 


Given sample size n,—400 





Proportion of me P, = = 0.5 


Proportion of women P, = — 
= 0.541 
Nul hypothesis Ho : Assume that there is no significant 
difference between the opinion of men and women as far as 
proposal of flyover 1s concerned 
Ho : P,—P5 
Alternative hypothesis Ho : P, x Ps (two tailed test) 
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The test statistic 


P4—P, 
E NUI UE Re 
JP I 


714 D4 3-71. p 
Where p = — 
na Han 


| | 400x0.5--600 x 0.541 
m 400--600 

525 

1000 


= 0.525 
q = 1-P 
= 1-0.525 = 0.472 
P, — Po 0.5 — 0.541 





pq u + 2 C0.525) (0.735) (45a 4 sa) 


—0.0-471 
D.012Z 


= -1.28 
4 = 1.28 
Calculated value of Z=1.28 


The tabulated value of Z at 5% level of significance is 
1.96 
Calculate value of Z < tabulated value of Z 
We accept null hypothesis 
(ie) there is no significant difference of opinion between men and 
women as for as proposal of flyover is emeerned. 
Illustration 5 
Before an increase in excise duty on tea, 800 persons out 
of a sample 1000 persons were found to be a tea drinkers. After 
an increase in duty, 800 people were tea drinkers in a sample of 
1200 people. Using standard error of proportion, state whether 
there is a significant decrease in the consumption of tea after the 
increase in excise duty. 
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Solution : By data, 











7*1, = 1000 714, = 1,200 

200 800 
a 1nnn z 120A 
= 0.8 = 0.667 
Now P — Ha Pat” Pa 

Tig tis 

m 1000 x 0.8 x 1200 x 0.667 

E 1000+1200 

— 1800 _ 9727 

22nn 
q = 1-p=1-0.727 = 0.273 
Nul hypothesis Ho : Assume that there is no significant 


difference in the consumption of tea before and after the increase 
in excise duty. 
Ho k P =P 2 


Alternative hypothesis H :P,-—P; (right tailed test) 
P4—P3i 


The test statistic 1s Z = p— 
ima zz) 


^ F2 
0,.8-—0.667 
= E (EET a N " "i 
M and Oo 273 (iz) 
0.98—0.6567 
0.0186 


— 6.972 
Calculated value of Z—6.972 


Tabulated value of Z at 5% level of significance (right 
tailed test) 1s 1.64. 

Calculated value of Z>Tabulated value of Z. 

We reject the null hypothesis. 
Ge) There 1s a difference in the consumption of tea before and 
after the increase in excise duty. 
Note : If we want to test the significance of the difference 
between P, and P then 
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Thy Pait Po 


= z e ` P,—P 

Then the statistic is Z = T——— 
dT i 

niu tni) 


Tilustration 6 


In a random sample of 400 students of the university 


teaching department, 1t was found that 300 students failed in the 


Examination. In another sample of 500 students of the affiliated 


colleges the number of failures 1s the same examination was 


found to be 300. Find out whether the proportion of failures in 


the university teaching department significantly greater than the 


proportion of failures 1n the university teaching departments and 


affiliated colleges taken together. 
Solution : By data, 


Given n, = 400 n> = 500 
sao 300 


P, = — = 0.75 Fa = — = 0.6 


406 5006 
71,224 T7045 P5 


p> 


300 soo; 
| 400255] 590 E 


4090+500 


= 0.667 
q =1-P 
= 1-0.667 = 0.333 
Null Hypothesis Ho : Assume that there is no 


difference between P; and P 


BEA P-P 
The test statistic 1s, 2 = odi. 


jpatzi - 


0.7 25—0.667 
i2o0zx0.667 x 0.283 
4 400 ¿C400 +500} 


0.083 
= —— = 44 
0.0175 


Calculate value of Z—4.74 
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significant 


Tabulated value of Z at 5% level of significance (Two 
tailed test) is 1.96. 
Calculated value o Z>Tabulated value of Z. 


Reject the null hypothesis 
The proportion of failures in the affiliated colleges 1s 


greater than the proportion of failures in university departments 


and affiliated colleges taken together. 


Test of significance for single mean 
Suppose we want to test whether the given sample of size 


n has been drawn from a population with mean p. We set up null 
hypothesis that there is no difference between X and ji where X is 


the sample mean. 


The test statistic is 





Where is the S.D of the population? 


If the population SD is not known, then use the statistic 
_ X—H 
AX = Ky 
NI 


Where S is the sample S.D 
Note : The values x + 1.96 — are called 95% financial limits or 





confidence limits for the mean of the population corresponding to 


the given sample. 


x + 2.5 a are called 99% confidence limits. 
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Illustration 7 

A sample of 900 members has a mean of 3.4 cms and S.D. 
2.61 cms. Is the sample from a large population of mean 3.25 cm 
and S.D. 2.61 cms. If the population is normal and its mean is 


unknown, find the 95% financial limits of the mean. 


Solution 
Given n—900 u=3.25 
x = 3.4 cm —2.61 
S=2.61 


Null Hypothesis 
Ho : Assume that the sample has been drawn from the 


population with mean p=3.25. 


Alternative Hypothesis 


H, : nz3.25 
The test statistic 1s 
E nt imd 
A7 
2.4— 2.25 
a z.63 
soo 


Calculated value of Z=1.724 
Tabulated value of Z at 5% level of significance (two- 
tailed test) 1s 1.96. 
Calculated value of Z<tabulated value of 
We accept the null hypothesis Ho 
(ie) The sample has been drawn from the population with mean 
u=3.25. 


95% confidence limits are 


2.61 
~ SOO 





x + 1.96 — — 3.4 + 1.96x 
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—3.4--0.1705 

= (3.44+0.1705, 3.4-0.1705) 

= 3.57 and 3.2295 
Illustration 

An insurance agent has claimed that the average age of 

policy holders who issue through him is less than the average for 
all agents which is 30.5 years. A random sample of 100 policy 
holders who had issued through him gave to following age 


distribution. 


Calculate the Arithmetic mean and standard deviation of this 
distribution and use these values to test his claim at 5% level of 


significance. 


Solution 


Computation of mean and standard deviation. 
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i Ex 
Mean x = A +H xe 


o 16x5 
= Z8 TTN — 28.8 





i . 2. wj 
Standard deviation 5 = J ore a (E) XC 
av M 


"IRSE ws 
— 6.35 
Null Hypothesis Ho : The sample is drawn from the population 
with mean pn(1.e) x and p do not differ significantly. 
Ge) Ho : n—30.5 years 
Alternative hypothesis H,:u<30.5 year (left-tailed test) 
Now x= 28.8, S=6.35 
u=30.5 n—100 
The test statistic is 
X—u 


Z =- 


— 


4 S 
23.8— 30.5 


= ——&aS — = — 2.677 
ao 
Z| = 2.677 
Calculated value of Z=2.677 
Tabulated value of Z at 5% level of significance is 1.645 
(left tailed test). 


Calculated value of Z>Tabulated value of Z.. 








The null hypothesis Ho is rejected. 
(1e) x and p differ significantly 
Ge) the sample is not drawn from population with mean u 
Test the significance for difference of mean. 


Let x, be the mean of a sample of size nı, from a 


population with mean p, and variance gÑ. 
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Let xz be the mean of a sample of size n, from a 


population with mean nu; and variance o5. 


To test whether there is any significant difference between 


xX, and xz we have to use the statistic 





Note : If the samples have been drawn from the same population 


then of = o$ = &* 


If is not known, we can usea /^ given to 


E- ngas enas” 
m Ftal T7715 
IHustration 


The mean of large samples 1000 and 2000 members are 
67.5 inches and 68.0 inches respectively. Can the samples the 


regarded as drawn from the same population of S.D 2.5 inches. 


Solution 
Given n, — 1000 n> = 2000 
x = 67.5 Inches x = 65 inches 
Population standard deviation =2.5 inches Null 


Hypothesis Ho : The samples have been drawn from the same 
population of S.D. 2.5 inches 
Ho : Wi=H2 
Alternative hypothesis H;,:n;,7J[o2 


To test statistic 1s 
_ Xi- fz 

" i= a 

YBa ng 

67.5—68 


jes" (z.5)7 
^ 1000 ` 2000 
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—0.5 
0.096€ 


= -5.16 
Zi = $5.16 
Calculated value of 7=5.16 


Calculated value of Z—Tabulated value of Z.. 
We reject null hypothesis to 


Ge) the samples are not drawn from the same population 


of S.D 2.5 inches. 
Illustration 


The mean yield of wheat from a district A was 210 pounds 


with standard deviation 10 pounds per acre from a sample of 100 


piots. In another district the mean yield was 220 pounds with 


S.D. 20 pounds from a sample of 150 plots. Assuming that the 


S.D of yield in the entire state was 11 pounds. Test whether there 


is any significant difference between the mean yields of crops in 


the two districts. 


Solution: 


Given Xi = 210,%5 = 200 
mn, = 100,n4 = 150 
Population S.D. -—11 


Null hypothesis Ho : There is no difference between x4, and 


Ge) Ho : X1— X; 


Alternative Hypothesis H1 : x, + x; 


The test statistic is 


Aa — Xp 


A = 
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= —— = 7.04178 


[is i121 
«4400 4156 


Calculated value of Z—7.04178 
Tabulated value of Z at 5% level of significance is 1.96. 
Calculated value of Z>Tabulated value of Z. 

We reject the null hypothesis Ho. 

(1e) There 1s a difference between x4 and x; 

(e) There is a significant difference between the mean 
yield of crops 1n the two district. 
Test of Significance of small samples. 
Definition 

Small sample 

When the size of the sample (11) is less than 30 than that 
sample is called a small sample. The following are some 


important tests for small samples. 


(1) Student's t-test 
(11) F-test 
(111) ? Test 


Student's t-test 
The student's t-test is defined by statistic 





X— 
r — = 
7 
_ XX 
Where x = ug u=population mean 
2  Zx-x 
S = FH n — sample size 
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Note : 


1. If the standard deviation of a sample is given directly, 


than the statistic is given by 
x — | 

S.D 

vn— 1 


T = 


Confidence or fiducial limits for u 
(1) If toos is the table value of t for (nı) degrees of 


freedom at 5 level of significance then 95% 


-— P = 5 
confidence limit for u is given by X + toos Um 
E 


(11) 99% confidence limit for u is 
= Ss : 
X + tuoi a where too; 1s the tabulated value of t for 


(n-1) degrees of freedom at 1% level of 
significance. 
Illustration 
The mean life of a sample of 20 fluorescent light bulbs 
produced y a company is computed to be 1570 hours with a S.D. 
of 120 hours. The company claims that the average life of the 
bulbs produced by the company is 1600 hours. Using to level of 
significance of 0.05, Is the claim acceptable? 
Solution 
Given, 
n=sample size—25 
x-—sample mean=1570 
p—population mean=1600 
S=standard deviation=120 
Null Hypothesis Ho 
The claim is acceptable 


Ge) Ho : u=1 1600 hours 
235 


Alternative hypothesis 
H;:pn»x: 1600 hours 


The test statistic is £ = -— 


«PI 1 


1570—1500 











Iti = 1.22 

Calculated value of t=1.22 

Degrees of freedom = n-1—25 

The tabulated value of t at 5% level of significance with 
24 degrees of freedom two tailed test is 2.06. 

Calculated value of t<Tabulated value oft 

We accept the null hypothesis Ho 

(ie) the claim that the average life of the bulbs produced 
by the company is 1600 hours is acceptable. 
IHustration 

The mean weekly sales of soap bars in departmental stores 
were 146.3 bars per store. After an advertising campaign the 
mean weekly sales in 22 stores for a typical week increased to 
153.7 and showed a S.D. of 17.2 was the advertising campaign 
successíul. 
Solution 

n=number of stores = 22<40 

The sample 1s small 
Also given, sample mean ¥ = 153.7 
Population mean p=146.3 
Sample S.D S=17.3 
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Since we known x, S.D and n, we use student's t-test. 


Null Hypothesis Ho : the advertising campaign was not 
successful. 


Alternative hypothesis H,:u>146.3 (Right tailed test) 


X—Hu 
5:4 
NL ES 
153.y—146.3 
iF 


422 i 
= 1.97 
Calculated value of t-test = 1.97 





The test statistic t = 





Tabulated value of t at 5% level of significance with 21 
degrees of freedom for right tailed test is 1.72. 
Calculated value of t > tabulated value of t 
We reject null hypothesis 
Advertising campaign was successful 
Illustration 
A random sample of size 26 values from a normal 
population showed a mean of 53 and a sum of Squares of 
deviation from the mean equals to 150. Can this sample be 
regarded as taken from the population having 56 as mean? Obtain 
95% confidence limits of the mean of the population. 
Given, n=sample size = 16 
x = sample mean — 53 
Six — x}* = 150 


Null hypothesis Ho : The sample is taken from the population 
having 56 as mean 
(1e) Ho z u=56 
PASA i 


Alternative hypothesis H,:41756. 


The test statistic £ = paid 
Jn 
33—56 
= — = —3.79 
Ji8 
iti = 3.79 


Calculated value of t-test = 3.79 
Degrees of freedom = n-1=16-1=15 
The tabulated value off t at 5% level of significance for 
15 degrees of freedom for two tailed test 1s 2.13. 
Calculated value of t-test > Tabulated value of t-test. 
We reject the null hypothesis Ho 
(i.e) the sample cannot be regarded as taken from the population 


The 95% confidence limit of the mean of the population is 
given by X + t9. o5 = 


vic 


= 53 + 2.13x Fe 





£ 
Wit 





= 53 + 2.132 


= 53+1.6827 

= (53+1.6827, 53-1.6827) 

= 54.68 and 51.31 
Hence 95% confidence limit 1s [54.68, 51.31] 
Students t-test (when S.D. of the samples is not given directly) 
Iilustration 

A random sample of 10 boys had the following I1.Q.’8 

70,120,110,101,88,83,95,98,107,100. Do these data support the 
assumption of a population mean I.Q of 100? Find the reasonable 
range in which most of the mean I.Q values of samples of 10 


boys lie? 
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Solution 
Here S.D. and mean of the sample are not given directly. 


We have to determine these S.D. and mean as follows. 


[98 


96.04 
7.84 














1833.650 
o 


— 203.73 
S? = 203.73 


S = 4203.73 


— 14.27 


Null Hypothesis Ho : The data support the assumption of a 
population mean 1.Q of 100 in the population. Ge. ) Ho:pn—100 
Alternative hypothesis H, : 1100. 


D ip 
The test statistic t = —z 
JA 
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97.2—100 
EE 14.3 F 
wid 
— 2.8 
/^— 4.514 


— -0.62 
lt] = 0.62 








Calculated value of t = 0.62 
Degrees of freedom = n-1=10-1=9. 


Tabulated value of t for 9 degrees of freedom at 5% level of 
significance 1s 2.26 (Two tailed test). 
Calculate value of t-test < tabulated value of t 
We accept the null hypothesis 1s Ho. 
The data support the assumption of 
Meem I.Q of 100 in the population. 
The 95% confidence limits are given b. 





z + to.os 7 = 97.2 +2.26 x AR 
= 97.242.26 x 4.514 


= 97.2 + 10.20 
= 97 2 + 10.20, 97.2-10.2 
— 107.41 and 86.99 
The 95% confidence limits within which the mean I.Q 
values of samples of 10 boys will be (86.99, 107.41). 
IHustration 
The height of 10 males of given locality are found to be 
70,67,62,68,61,68,70,64,64,66 1nches. Is it reasonable to believe 
that the average height 1s greater than 64 inches? Test at 5% level 


assuming that for 9 degrees of freedom P(t>1.83)=0.05S. 
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Solution 


Calculation for sample mean and SD 


x C 


a 
| 
N 
| 
a: 
| 
e 
e 
^ 
| 
3, 


AIIAJN ON ON[O 


a 
| 

= 

e 
T 


mà N ba 


x=660 = 90 


Null Hypothesis Ho 
The average height is not greater than 64 inches. 
Ge) Ho:n—64 inches. 
Alternative hypothesis H, : >64 inches 





Test statistic t = —g 








Calculated value of t—2 
Degrees of freedom = n-1 
= 10-1=9 
Tabulated value of t for 9 degrees of freedom at 5% level 
of significance for single tailed test 1s 1.83. 
Calculated value of t>Tabulated value of t 


We reject the null hypothesis 1s Ho 
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Student's t-test for difference of means 


To test the significant difference between two means x4 


and x4 of samples of sizes n, and nz, use the statistic. 


£l Xam 
gx 
^w oT Fe 
When sS? — TA m Tre ay" 
= = 
My Sy Ft gS 
(or) S? = 471 37— 


714 --14 —2 
Sı, S2 are sample standard deviation, this case, degrees of 
freedom = n,--n5—2. 
Illustration 
Samples of two types of electric lights bulbs were tested 
for length of life and following data were obtained 


Type 1 Type 2 
Samples Number n,=8 No=7 
Samples mean X4 = 1234 hrs Xa = 1036 hrs 
Sample SD S; = 30 hrs S5 — 40 hrs 


Is the difference in the mean sufficient to warrant that type 


I is superior to type II regarding length of life. 


Solution :- 
Given, n,;=8 Nno=7 
X, = 1234 x, = 1036 
S,—736 S2=40 
g2 — 725i+ 7253 
tatin Z 
|. 8C36)7-7C40)7 
8--7—2 
— 1659.08 
S  —4/1658.08 
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Null Hypothesis Ho : The two types I and H of electric bulbs are | 


1dentical 
Alternative hypothesis H; : >> 


The test statistic t = 


1234—1036 








Is a 
VEGSS.08 x jats 


= 9.39 
Calculated value of t = 9.39 
Degrees of freedom = n,+n>5-2 
= 8+7-2 
= 13 
Tabulated value of t for 13 degrees of freedom at 5% level 
of significance (one tailed test) is 1.77. 
Calculated value of t>tabulated value of t 
We reject the null hypothesis HO 
G.e) The two types I and II of electric bulbs are not 
identical. 
Hilustration 
The means of two random samples of size 9 and 7 are 
196.42 and 198.82 respectively. The sum of the squares of the 
deviation from the mean are 26.94 and 18.73 respectively can the 


sample be considered to have been drawn from the same normal 


population. 
Solution 
Given n,—9 n52—7 
X4, = 196.42 X; = 198.82 
Sx, — X;)^ = 26.94 slir — x;)" = 18.73 
_ ear — X33 xa 7 X327 


Ti4-F7E04 5 


G2 
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96.94412.73 
m 647-2 © 


S* = 3.26 
S=1.81 
Null Hypothesis Ho 
The two samples are drawn from the same population. 
Ge) HO: pi= p2 
Alternative hypothesis Hy: 4,7 pe 


3.26 





Wl—IYI 
The test statistic £ = —;———— 
i a 
E 
|| 1956.42—198.82 
1.91 re 
^ g 7 
—2.4 
= = — 32.63 
0.912 
t| = 2.63 


Calculated value of t=2.6 

Degrees of freedom = n;+n2-2 
= 9+7-2 
= 14 


Tabulated value of t for 14 degrees of freedom at 5% level 
of significance is 

2.15. 

Calculated value of t> tabulated value of t. 

We reject null hypothesis Ho. 

Ge) The two samples are not drawn from the same 
population 
Hliustration 


The nicotine content in milligrams of two samples of 


tobacco were found to be as follows. 


Sample | z | 23 [| 26 | a2 |35 | — 
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Can it be said that two samples come from normal population 


having the same mean. 


Solution 


Calculation for sample mean and S.D’s 





{x — x)" = 21.2 

Ax — x,}* = 108 

2 __ Hx,- 3a) -(x4—x4)? 
We tiEng_n 


_ £14.2+108 
5+6—2 


S 


= 14.35 
S? = 14.35 


S = 414.35 = 3.78 
Null hypothesis H, 
The two samples have been drawn from the normal 
population with the same mean 
Ge) Ho: 712 
Alternative hypothesis Ho : n,7ps 


X—X 
1 i 

Sni az 
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The test statistic t = 


3.78 | ats 
= 1.92 
jt] = 1.92 


Calculated value of t = 1.92 
Degrees of freedom = n,+n>-2 
= 5+6-2=9 


labulated value of t for 9 degrees of 5% level of 


significance is 2.262. 
Calculated value of t<Tabulated value of t 


We accept the null hypothesis Ho 
Ge) The samples come from normal population with the 
Same mean. 


F-Test 
lo test whether if there is any significant difference 


between two estimates of population variance (Or) 

lo test if the two samples have come from the same 
population, we use F-Test. 

In this case, we set up null hypothesis. 


Hy: of = o% (i.e) Population variances are same 


Under Ho, the test statistic is 


SS . : 2. 2 
Where s2 = 227s)" 
Tt. — d 
oz AY x.—3X2 = 
Tin —İ 


The degrees of Freedom A,—nQ0;—17vy-—naj4—1 


(or) 


si. . 
F = — if if S? > Sz 
z 
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Illustration 

In one sample of 8 observation the sum of the squares of 
deviation o the sample values from the sample mean was 84.4 
and in the other sample of 10 obervations it was 102.6 Test 


whether this difference is significant at 5% level. 


Solution 

Here n,=8 n5—10 

Jx — X,)” = 84.4 Ax, — X;)? = 102.6 
ER 2 

SQUE 2 = 12.087 
"e 

2 Ax- F? 102.6 — 
SS e PE 


Null hypothesis Ho 

o? = of Population variances are same. 
Alternative hypothesis Hı = o£ + o3 
The test statistic 


Calculated value of F=1.067 
Degrees of freedom, YQ; = na — 1=8-1=7 
F2 = MNz—1,—-10-1=-9 

Tabulated value of F at 5% level of significance for (7,9) 
degrees of freedom is 3.29. 

Fo os (7.9) = 3.29 

Calculated value of F < Tabulated value of F 

We accept the null hypothesis Ho 


There is no significant difference between population 


variances. 
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Illustration 

It is known that the mean diameter of rivets produced by 
two firms A and B are practically the same but the standard 
deviation differ. For 22 riverts. Produced by firm A, the standard 
deviation is 2.99 mm and for 16 rivers produced by firm B and 
standard deviation is 3.8mm. Copute the statistic you would use 
to test whether the products of firm A have same variability at 
those of firm B and test its significance. 
Solution 

Given n,—22 n5—16 


$S,—2.9 $2=3.8 


Here the S.D.’s of the samples S1 and S2 are given 
The population variance Sf and SZ and obtained by 
using the relations 
11,57 = (n,— 1s? 


22x(2.9)? 


?E4 — 1 
= 8.805 
n54Sj = (n5 — DSF 


162(3.9)* 
/^—  16—1 


— 15.393 
Null Hypothesis Ho : o2 = c2 
G.e) The product of both the firms A and B have the same 
variability. 
Since SZ > SZ, 
The best statistic 


53 
si 


Lnd 
rdi 
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= 1.74 
Calculated value of F=1.74 
Degrees of freedom = (r,,7) 
E (11. p T4 1) 
Tabulated value of F with (15,21) degrees of freedom at 5% level 
of significance is 2.20. 
Calculated value of F< Tabulated value of F 

We accept the null hypothesis Ho 

Ge) the products of both the firm A and B have the same 
variability. 
Illustration 


Ihe time taken by workers in performing a job by method 


I and Method II is given below. 





Do the data show that the variances of time distribution from 
population from which these samples are drawn do not differ 
significantly? 

Solution 


Calculation of sample variances 








Aix, —x,)^ = 81.34; Mx — x;)? = 133.72 
çZ — 1 Fy" 








= 22.29 


Null Hypothesis Ho : There is no significant difference between 
population variances. 


(ie) Hy: o2 = o2 
Alternative hypothesis H,: œ + o2 


SinceS7 > SZ, the test statistic 


Calculated value of F=1.37 
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Degrees of freedom = (n4, — L.n — 1) 
— (7-1, 6-1) 
= (6,5) 
Tabulated value of F for (6,5) degrees of freedom at 5% level of 
significance is 4.95. 
Calculated value of F test « Tabulated value 
We accept the null hypothesis Ho 
Ge) There is no significant difference between the variances of 
the time distribution by the workers. 
IHustration 


The nicotine contents in milligrams in two samples of 


tobacco were found to be as follows. 





Can it be said that two samples come from same normal 
population. 
Solution 

To test whether the two samples come from the same 
normal population we have to test 

G) The equality of variance by using F-Test 

a1) The eguality of means by using t-test 


Given n,=5, no=6 
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Calculations for mean’s and S.D’s of the samples 





2 — PR m E 
$1 714 — 1 4 
Ix 108 
X; = — = — = 21.6 
x 2 
xc Xo- 108 
5 = Shy)" = —= 91.6 
Thy —t 3 


(1) F-Test 
Null Hypothesis H4: o? = of 
(1e) The population variances are equal 


If S2 > S£, the test statistic 


Calculated value of F = 4.07 
Degrees of freedom = (nz2-1, n,-1) 
= (5.4) 
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labulated value of F for (5,4) degrees of freedom at 5% 
level of significance is 6.26 calculated value of F< Tabulated 
value of F. 

We accept the null hypothesis Ho 

Population variances are equal 
(11) t-test, 

Refer illustration 
CHI-SQUARE TEST OF GOODNESS OF FIT 

Suppose we are given a set of observed frequencies 
obtained under some experiment and we want to test if the 
experimental results support a particulars theory or hypothesis. 
Karal pearson developed a test for testing the significance of 
discrepancy between experimental values and the theoretical 
values obtained under some theory or hypothesis. This test is 
known as ° test of goodness of fit. Karl Pearson proved that the 


Statistic. 


ye Ny o EY 


Where O-Observed frequency 
E — Expected frequency 
* is used to test whether differences between observed 

and expected frequencies are significant. 

If the data is given in a series of ‘n’ number then degrees 
of freedom - n-1 

In the case of Binomial distribution d.f — n-1 

In the case of Poisson distribution d.f—n-2 

In the case of Normal distribution d.f=n-3 
Illustration 

The number of automobile accidents per week in a certain 
community are as follows 12,8,20,2,14,10,15,6,9,4 Are these 
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frequencies in agreement with the belief that accident conditions 
were the same during this 10 week period. 


Solution 


Expected frequencies of accident 
12+8420424144+1071546+ 9+4 


Each week = 
10 


160 
2 _ A 10 
10 


Null hypothesis Ho : The accident conditions were the same 
during the 10 week period. 
Observed Expected (O-EY 
— — E 


f 


(O—E3? 
NEU 


V^ Test = 
E 


— 26.6 
Calculated value of  ^-Test = 26.6 
Degrees of freedom = n-1 
= 10-1 
—9 
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Tabulated value of ° Test for 9 degrees of freedom at 5% 
level of significance is 16.9. 

Calculated value of ^ test > Tabulated value of  ^—test 

We reject the null hypothesis Ho 
(ie) The accident conditions were not the same during the 10 
week period. 
Illustration 

The theory predicts the proportion of beans in four groups 
A,B,C and D should be 9:3:3:1. In an experiment with 1600 
beans the numbers in the four groups were 382, 313, 287 and 
118. Does the experimental result support the theory. ` 
Solution: If we divide (382+313+287+118~1600) in the ratio 
9:3:3:1 we get the expected frequencies are 900, 300, 300, 100. 


Total 1606 





5*3*341 ae 100 
9 Parts = 900 
3 Parts = 300 
3 Parts = 300 
l Part = 100 


Observed Expected 
frequency Frequency 





255 


(Oo —E)* 
E 


— 26.6 
Calculated value of 7-Test = 4.726 


The test statistic Test = X 


Degree of freedom = n-1 = 4-1 =3 

Tabulated value of ^ Test for 3 degrees of freedom at 5% 
level of significance 1s 7.81. 

Calculated value of  ?-test < Tabulated value of  ^-test 

We accept the null hypothesis go. 

To the experiment support the theory 
CHI SQUARE TEST FOR INDEPENDENCE OF 
ATTRIBUTES 
Definition : Literally, an attribute means a quality or 
characteristic. Example of attributes are drinking, smoking, 
blindness, honesty etc., 

An attribute may be marked by its presence (position) or 
absence in a number of a given population. 

Let us consider two attributes A and B. A is divided into 
two classes and B is divided into two classes. The various cell 
frequencies can be expected in the following table known as 2x2 


contingency table. 





The expected frequencies are given by 
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adc ia T c)ta + by b ( + d)(a +b) 
[OE ro CED o 
(a 3 c) (a + c)Ca + d) bd) (5r d)c4b5 
[oom | 0 Deme 
henua s 


Null hypothesis Ho : Attributes are independent 













Degrees of freedom = (1-1) (s-1) 
Where r = number of rows 
S = number of columns 
Illustration 
The following table gives the classification of 100 workers 
according to sex and nature of work. Test whether the nature of 


work is independent of the sex of the workers. 





Males [Stable [Unstable [| Tom — 
mais | — 4 | 20 | 6 — 
Females | — 10 | 30 | o — 


Solution 







Null hypothesis Ho : Attributes are independent 


(i) The nature of work is independent of the sex of the 


workers. 


Expected frequencies are given in the table 
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2 


Calculation of 





(O—E)* 
2, E 


YW Test = 
= 16.666 
Calculated value of ^- Test = 16.666 
Degrees of freedom - (r-1) (S-1) 
= (2-1) 2-1) 
= ]x1-1 
Tabulated of  ^-test for 1 degree of freedom at 5% level of 
significance 1s 3.84 
Calculated value of 7>tabulated value of ^ 
We reject the null hypothesis Ho 
Attributes are not independent 
(1e) The nature of work 1s not independent of the sex of the 
workers. 


Illustration 


In a certain sample of 2000 families 1400 families are 
consumers of tea. Out of 1800 Hindu families, 1236 families are 
consumers of tea. Use  ^-test and state whether there is any 
significant difference between consumption of tea among Hindu 
and non-Hindu families. 


Solution: By Data, 
258 





Null hypothesis Ho : There is no significant difference between 
the consumption of tea among Hindu and Non Hindu families the 


expected frequencies are 


| Hindu |  Non-Hindu | Total 


mi 1800x1400 200x400 
tea 2000 2000 


Not — 1800x600 200x600 
consuming — —————— = 540 et esr) 
tea 2000 2000 
















—F 
UF Post = y ? 


E 
— 15.238 
Calculated value of 7 = 15.238 
Degrees of freedom = (1-1) (s-1) 
= (2-1) (2-1) 
=] 
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Tabulated value of  ?-test for 1 degree of freedom at 5% 
level of significance is 3.841. 
Calculated value of ^» Tabulated value of 
We reject the null hypothesis Flo 
(ie) Both Hindu and Non-Hindu families differ significantly as 
regard the consumption of tea among them. 
SUMMARY | 
This chapter summarizes the t-test, F-test, Chi-square test are 
commonly used in business research. 
KEYWORDS 
Null Hypothesis 
Level of significance 
Degrees of freedom 
Table values 
Sample test value 
CHECK YOUR PROGRESS QUESTIONS 

1. A machine which produces mica insulating washers for 
use in elastic devices is set to turn out washers having a 
thickness of 10mm. A sample of 10 washers has an 
average thickness 9.522mm with a S.D. of 0.6mm 
calculate student’s t-test. 

2. A random sample of size 20 from a normal population 
gives a sample mean of 42 and sample S.D 6-Test the 
hypothesis that the population mean 1s 44. 

3. The average number of articles produced by two machines 
per day is 200 and 250 with S.D. 20 and 25 respectively 
on the basis of regards of 25 days production. Can you 
regard both the machines equally efficient of 1% level of 
significance’? 
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4. Memory capacity of 10 students was tested before and 


after training state whether the training was effective or 


not from the following scores. 


training 





After 
training 


5. In one sample of 10 observations, the sum of the squares 
of the deviation o the sample values from the sample mean 
was 120 and in the other sample of 12 observations it was 
314. Test whether the difference is significant at 5% level. 


Two samples are drawn from the normal populations are 
gere o De oo [po [n n | [o [|] 1— 


7. A die 1s thrown 120 times and the frequencies of various 





face are as follows 


Fano | 1 | 2 |] 3 | * |] 5 | 6 
Frequeney[ 10 | 1s | 25 | 25 | i$ | 27 


Test whether the die was fair 









8. In a set of random numbers, the digits 0,1,2....9, were 


found to have the following frequencies. 





Cr pese 


Test whether they are significantly different from those expected 


on the hypothesis of uniform distribution. 
9. 1000 students at college level were graded according to 


their I.Q and the economic conditions of their homes use 
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? test to find out whether there is any association 


between economic conditions at home and IL.Q. 





10. A machine puts out 16 in perfect articles in a sample 500 


11. 


12. 


13. 


14. 


after machine is over hauled; it pulls out 3 imperfect 
articles in a batch of 100. Has the machine improved? 

A machine produced 20 defective articles in a batch of 
AOO. After over hauling it produced 10 defectives in a 
batch of 300. Has the machine improved? 

In a random sample of 400 persons from a large 120 are 
females can it said that males and females are in the ratio 
5:3 in the population use 1% level of significance. 

4 coin is tossed 900 times and head appears 490 times. 
Does this result support the hypothesis that the coin is 
unbiased. 

A wholesale apples claim 4% of the apples supplied by 
him are defective a random sample of 600 apples 
contained 36 defective apples. Test the claim of the 


wholesaler. 
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Q. 
IN 


Denominator Degreas of Freedom 


tj Loud 
Syg én PEDD EERE Selah BUNS wes V blu 


141.45 
18.513 
10.128 


7.2086 


6,6070 
5.9874 
5.53914 
5.3177 
5.1174 


4.9646 
4,8443 
4.7472 
4,6572 
4.6001 


4.54231 
A, A940 
4.4513 
4.4139 
4.3807 


4.3512 
4.3248 
4.3009 
4.2793 
4.2597 


4.2417 
4.2252 
4.2100 
4.1960 
4.1830 


4.1709 
4.0847 
4.0012 
3.9201 
3.8415 





199.50 
19.000 


9.5521 
9.9443 


5.7861 
5.1433 
4.7374 
4 A590 
$2005 


4.1023 
3.9823 
3.8853 
3.8056 
3.2389 


3.6823 
3.6337 
3.5915 
3.5546 
3.5219 


3.4928 
3.4668 
34434 
3.4221 
3 4028 


3.3852 
3.3690 
3.3541 
3.3404 
3,3277 


3.3158 
3.2317 
3.1504 
S.O7 ER 
2.9957 


215,71 
19.164 


35.7083 
3.5874 
3.4993 
3.4105 
3.3439 


3.2874 
3.2389 
3.1968 
3.1599 
3.1274 


3.0084 
3.0725 
340491 
3.0280 
35.0088 


29912 
259752 
2 9604 
2,0467 
2,9340 


2.39223 
2.5387 
2.7581 
2,5802 
2.6049 
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224.58 
19.247 


9.1172 
6.5882 


5.1922 
4.53537 
4,1203 
3.4379 
3,6331 


3,4780 
3.3567 
3.2592 
3.1791 
3.5122 


2141 
2.7014 


2.6896 
2,6060 
2.5252 
24472 
2.3719 


230.16 
19.296 


9.0835 
6.2561 


5,0503 
4.3874 
3.9715 
3.6875 
3.4817 


3.3258 
3.2039 
3.18059 
3.0254 
2.9582 


£9013 
2.8524 
2.8 FEM 
2,7229 
2,7401 


2.7109 
2.6848 
2.6613 
z,6400 
26207 


2.6030 
2.5868 
2.5719 
2.5581 
2.5454 


2.5530 
2 A495 
2.3683 


2.2899 | 


2.2141 


6 ^ 


233 99 


19.330 
8.9406 
5.163t 


4.9503 
4.2839 
3.8660 
3.5806 
3.3738 


3.2172 
30946 
29961 
2.9153 
2458417 


2,7905 
2,7413 
2 45998 
2.6613 
2.6283 


2 5990 
2.5727 
2.549] 
2.5277 
2.5082 


2,4904 
24741 
2.4591 
2.4453 
2,4324 


2.4205 
2.3389 
2.2541 
2.1750 
2,0986 


F - Distribution (Q = 0.05 in the Right Tail) 


Numerator Degrees of Freedom 
5 


236.77 


19.353 


238,88 
19,371 


8.8452 
6.0410 


4.3183 
4.1463 
3.7357 
3.4381 
3.2296 


3,0717 
2.9480 
2.84586 
2.7669 
2.6987 


2.64408 
2.5981 
2.54430 
2.3102 
2.4768 


24471 
24205 
2.3965 
2.3748 
2.3551 


2.3371 
2.3205 
2,9453 
2.2913 
2.2783 


2.2662 
2.1802 
2.40970 
2.0164 
1.9384 


240.54 
19.385 


& 8123 
6.9988 


4.7725 
4.0990 
3.6767 
3 3881 
3.1789 


3,0204 
2.8962 
2.7964 
2.7144 
Z 6458 


2.5816 


Denominator Degrees of Freadom 


ad 


40 


F- Distribution (œ= 0.05 in the Right Tail) - 


— Numerator pe of Freedom 





10 


241.88 
19.396 


8.7855 
5 964-4 


4.73351 
4.0600 
3.4385 
3.3472 
3.1373 


2.9782 
2.8536 
2.7534 
3.6710 
2.6022 


2.5437 
2.4935 
2.4499 
233117 
2.3779 


2.3479 
2.3210 
2.2967 
tarar 
22547 


22365 
2.2197 
2.2043 
2.1900 
2.1768 


2.1646 
20772 
1.9926 
1.9105 
1.8307 


243 91 
19.413 


8.7446 
5,9117 


4.6777 
3.9999 
3.5747 
3.2839 
3.0729 


2.9130 
2.7876 
2,6866 
2.6037 
2,5342 


24753 
24247 
2.3807 
2.3425 
2.3040 


22776 
22504 
2.225% 
222036 
2.9834 


2.1649 
2.1479 
2.1323 
2.1179 
2.1045 


2 0921 
2.0035 
[.9174 
E.8337 
1.7522 


245.95 
19.429 


4.7029 
5.8578 


4.6188 
3.9381 
3.5107 
3.2184 
3.0061 


2,8450 
2,7186 
2.6169 
2.3331 
24630 


2.4034 
2.3522 
2.3077 
J 2686 
2.2341 


2.2033 
2.1257 
2.1508 
2.1287 
2.1077 
208589 
2.0716 
2.0558 
2.0411 
2.0275 


240148 
1,9245 
1,8364 
1.2505 
1.6664 
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248 a 
19.446 


8.6602 
5.5025 


4.5581 


24905. 
19.454 


£.6385 
5.7744 


4.5272 
3.8415 
3.4105 
31152 
z,9008 


2.7372 
2 090 
2.5055 
2 202 
2.3487 


2.2878 
2.2354 
2.1898 
2.1497 
2.1141 


20825 
240540 
240233 
2.00560 
t.98538 


1.9643 
1.9464 
1.9299 
1.9147 


250. ID 
19.462 


£6166 
8,7459 


4 A957 
3.8082 
3.3758 
3,9794. 
2.3637 


2,5906 
2.5705 
24663 
2.3803 
2.3082 


2,2468 


2.1938 
2.1477 
2.1071 
200712 


2,0391 
240102 
1.9842 
1.9605 
1.9390 


1,9192 
1.9010 
1.8842 
1.8687 
1.8545 


1.849 
i 7444 
§ 6492 
1.5343 
1.4591 


251 A4 
19,471 


8.5944 
5.7170 


4.4638 
3.7743 
3.3404 
3.0428 
2.8259 


2.6609 
2.5309 
2.4259 
2.3392 
22664 


2.2043 
2.1507 
2.1040 
20629 
2.0264 


1.9938 
1.9645 
1.9380 
1.9139 
1.8920 


1.8718 
E.8533 
i E361 
1.8203 
1348055 


1.7518 
1.5028 
1.5943 
1.4952 
5.3940 


252.20 
19.479 


8.9720 
5.6877 


4,4314 
3,7398 
3,3043 
3.0053 
223872 


z62EH 
2.4901 
2.3842 
22966 
2.4229 


2.1601 
2.1058 
2,0584 
2.0166 
1.9795 


1.9464 
1.9165 
1.8894 
1.8648 
1.8424 


1.8217 
3.8027 
1.7851 
1.7689 
1.7537 


1.7396 
1.6373 
1.5343 
1.4290 
1.3189 


253.25 
19.487 


8.5404 
S.6381 


4.3985 
3.7047 
3.2674 
45669 
2.7475 


2.5801 
241420 
23410 
24524 
2,1778 


2.4 ES] 
20559 
20107 
1.966 1 
1.9302 


1.8963 
1.8657 
1.8380 
1.8128 
1.7896 


1.7684 
1.7488 
1.7306 
1.7138 
1.6981 


1.6835 
1.5766 
1.4673 
1.3519 
12214 


254.31 
19.496 


8.5264 | 
5.6281 | 


434550 | 


3,6689 


3,2298 | 


2,9276 
2.7067 


2.5379 


24045 | 


2962 | 
2.2064 | 
2.1307 | 
20658 | 
20096 | 


EIG | 
T.Si6s | 


1.8780 | 


1.8432 
1.8117 
1.7831 
1.7578 
1.73360 


1.7118 
1.6906 
1.6717 
1.6541 
1.6376 


1.6223 
1.5089 
1,3895 
1.2239 
$0000 


Denominator Degrees of Freedom 


h 


w 


3.9491 
3.7816 
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5624.6 
99.249 
28.110 
15.977 


11.392 


59.1483 
7.8466 
T.0061 
6.4221 


5.9943 
5.6683 


3859.0 
99,333 
27 531i 
15.207 


10.672 


8.4661 
7.1914 
6.3707 
5,8018 


5.9858 
5.0692 
43.8206 
4.6204 
4.4558 


4.3183 
4.2016 
43015 
4.0146 
3.9386 


3.8714 
3.8417 
3.2583 
3.7102 
3.6567 


3.6272 
3.5911 
3.5580 
15276 
3.4995 


3.4735 
3.2910 
3.1187 
2.9559 
2.8020 


5928.4 
99.355 
IPSI 
14.976 


10.456 


8.2600 
£$.9928 
56.1776 
3.6129 
3.2001 
4.8861 
4.6395 


5981.1 
99.374 
27.489 
14.799 


10.289 


8.1017 
6.8400 
6.0289 
3.4671 


540567 
4.7445 
4.4994 
4.3021 
4.1399 


4.0045 
3.8896 
3,7910 
3.7054 
345305 


3.5644 
3.5056 
344530 
3.057 
3.3629 


3.3239 
3.2884 
3.2558 
5,2259 
3.1982 


3.1726 
2.9930 
2.8233 
24629 
2.5453 





6022.5 
OF 388 
21.345 
14,639 


10.158 


7.9761 | 
6.7188 
5.9106 
8.351f 


4.9424 
4.5315 | 
4.3875 | 
44911 | 
4.0297 | 


3.8948 
3.7804 
3.6822 | 
3.5971 
3.5225 | 


3.4567 
3.3981 
3.3458 | 
2.2986 | 
3.2560 | 


3.2172 
3.1818 | 
3.1494 | 
34195 | 
3.0920 | 
3.0665 | 
2.8876 | 
2.7185 

2.5586 
2.4073 | 


Denominator Degrees di Fiada m 





6055.8 


95.399 
27,229 
14.546 


10,0381 
TRTA 
0201F 
5.8143 
5.2565 


4.8491 
4.5393 
4.2961 
4.1003 
3.9394 


3.8049 
3.66909 
3,5931 
3.5082 
3.4338 


3.3682 
3.398 
32576 
3.2106 
3.1681 


3.1294 
3.094 1 
3.0618 
3.0320 
3.0045 


29791 
2.8065 
2,5318 
2.4721 
2.3209 


i2 


61063 


qu 416 
37.052 
14.374 


9.8883 
7.7183 
6.469] 
5.6667 
5.1114 


4.7059 
4.3974 
4.1553 


> 4.9603 


3.800) 


A562 
3.5527 
3,4552 
3.3706 
3.2963 


3.2311 
3.1730 
3.1209 
340740 
5.0316 


2.9931 
2.9572 
£9256 
2.8959 
28635 
28431 
2.6648 
2.4961 
2.3363 
2.1847 





15 


6187.3 


99.433 
26.872 
14.198 


9.7222 
7.5590 
6.3143 
5.5151 
4.9621 


4,5581 
3,2509 
340096 
3,8154 
3.6557 


3,5222 
3,5089 
3,3117 
32273 
3.1333 


3.0880 
340300 
235779 
235311 
2.8887 


2.8502 
2.8150 
2.7827 
2.7530 
2.7256 


2,2002 
2.5216 
2.3523 
2,1915 


20385 


266 





6208.7 


20 


75469 
26.690 
14.020 


9.5526 
7.3958 
6.1554 
5.359] 
4.8080 


4.4054 
4.099) 
3.8584 
3.6646 
3.9052 
3.3719 
3.2587 
3.1615 
34771 
3.0031 


29377 
2.8796 
28274 
2.7805 
2.7380 


2.8993 
26640 
26316 
2.6017 
2.5742 


2.5487 
2.3689 
2.1978 
2.0346 
} 8783 





24 


6234.6 


99 453 
26.598 
È 3.929 


9,4565 
7.34121 
&.0743 
82793 
4.7290 


4.3269 
4.0209 
3.7805 
3.5868 
3,8214 
3.2940 
3_F BOS 
3.0835 
2.9990 
29249 


28554 
2.2010 
2.7488 
2 T7 
2.6391 


2.6203 
2.5848 
21.5322 
2,5221 
2.5946 


2.4689 
2.2880 
2.1 154 
1.95060 
1.7908 








6260.6 
99 466 


26.505 
13.838 


9.3793 
7.2285 
5.9920 
5.1981 
4.6488 


4,2469 
3,9411 
3.7008 
3.5070 
3.3476 


3,2141 
3,1007 
3.0032 
2.5185 
2.8442 


2.7785 
2.7200 
2.6675 
2.62602 
2.5773 


2.5383 
2.5026 
2 4699 
2,4397 
24158 


2.3860 
2.2034 
20285 
1.8600 
1.6964 


A 


6286.8 


99 474 
26441 
13.745 


9.2912 
7.19832 
5.9084 
8.1156 
4.5666 


4.1653 
3,8596 
3.6192 
3.4253 
3.2656 


3.1319 
3.59182 
2,9205 
2,8354 
x 608 


245047 
2.63559 
25831 
2.5355 
2,4923 


2.4530 
24170 
2.3840 
2.3535 
2.4253 


2.2992 
2.1142 
1.9360 
1.7628 
1.5923 


F - Distribution («x = 0.01 in the Right Tail) 


Numerator Degrees of Freedom 


60 


6313.0 
99 422 


26.316 
13.652 


9.2020 
7 00867 
5.8236 
5.0316 
4.4831 


3,08 I9 
3.7761 
3,5355 
3,3413 
3.1813 


34871 

2.9330 
2.85348 
2.7493 
2742 


2465077 
2.5484 
2,4951 
LAAT I 
2.4035 


2.3637 
2.3273 
2.2938 
2.2629 
2.2344 


2.2079 
441184 
L.B363 
1.6557 
1.47560 


126 


6339.4 


9o 459] 
26.221 
13.558 





6365.9 

99 499 
26.125 
3.463 


59.0204 
6.89500 
5.56495 
4.8583 
4.3105 


3,9090 
3.6024 


Table of Chi-square statistics 


[af |P = 0.05 [P = 0.01 IP = 0.001 
[1 | 3.84 | 664 | 10.83 
|2 | 599 | 921 | 13.82 
[3 | 7.82 [1135 | 1627 
|4 | 9.49 | 13.28 | 1847 _ 
[s | 1107 | 15.09 | 20.52 
|6 | 12.59 | 1681 | 22.46 _ 
|7 | 14.07 | 1848 | 2432 | 
[8 | 15.51 | 20.09 | 26.13 | 
[9 | 1692 | 21.67 | 27.88 | 
|10| 18.31 | 2321 | 29.59 
[11| 19.68 | 24.73 | 31.26 | 
|12| 21.03 | 26.22 | 32.91 
13| 22.36 | 27.69 | 34.53 
|14| 23.69 | 29.14 | 36.12 
115| 25.00 | 30.58 | 37.70 
|16| 26.30 | 32.00 | 39.25 | 
|17| 27.59 | 33.41 | 40.79 
|18| 28.87 | 34.81 | 42.31 | 
119| 30.14 [| 36.19 | 43.82 | 
I20| 31.41 | 37.57 | 45.32 
I21| 32.67 | 38.93 | 46.80 
| 33.92 | 40.29 | 48.27 
| 35.17 | 41.64 | 49.73 
| 36.42 | 42.98 | 51.18 
| 37.65 | 44.31 | 52.62 
| 38.89 | 45.64 | 54.05 
| 40.11 | 46.96 | 55.48 
| 41.34 | 48.28 | 56.89 
| 42.56 | 49.59 | 58.30 
| 43.77 | 50.89 | 59.70 
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PERCENTAGE POINTS OF THE T DISTRIBUTION 


Tail Probabilities 

One Tail O.10 0.05 0.025 0.041 0.005 0.001 90.0005 
Two Tails 0.20 0.10 0.05 0.02 O. Oi 0.002 0.9001 
pet ee ene E a a —————— ————— ————————— E E 
D. i | 3.078 6.314 12.71 31.82 63.66 318.3 637 

a 2 | 1.886 2.920 4.303 6.965 9.925 22.330 31.6 
"e 3 | 1.638 2.353 3.182 4.541 5.841 10.210 12.92 
"s 4 | 1.533 2.2132 2.776 3.747 4.604 7.173 8.610 
pe 5 | 1.476 2.015 2.571 3.365 4.032 5.893 6.869 
k^ 6 | 1.440 1.943 2.447 3.143 3.707 5.208 5.959 
E 7 | 1.415 1.895 2.365 2.998 3.499 4.785 5.408 
E & | 1.397 1.860 2.306 2.896 3.355 4.501 5.041 
^ 9 | 1.383 1.833 2.262 2.821 3.250 4.297 4.781 
g^ 10 | 1.372 1.812 2.228 2.764 3.169 4.144 4.587 
ui iz | 1.363 1.796 2.201 2.718 3.106 4.025 4.437 
E" 12 | 1.356 1.782 2.2179 2.681 3.055 3.930 4.318 
p 13 | i.350 1.771 2.160 2.650 3.012 3.852 4.221 
p id | 1.345 1.7641 2.145 2.624 2.977 3.787 4.140 
E" 15 | 1.341 21.753 2.2431 2.602 2.947 3.733 4.073 
E 16 | 1.337 1.746 2.120 2.583 2.921 3.686 4.015 
E^ 17 | 1.333 1.740 2.2110 2.567 2.898 3.646 3.965 
E is | 1.330 1.734 2.101 2.552 2.878 3.610 3.922 
= 19 | 1.328 1.729 2.093 2.539 2.861 3.579 3.883 
ui 20 | 1.325 1.725 2.086 2.528 2.845 3. 552 3.850 
= 21 | 1.323 1.721 2.080 2.518 2.831 J: O27 3.819 
ES 22 | 1.321 1.717 2.074 2.508 2.819 3.505 3.792 
d 23 | 1.319 1.714 2.069 2.500 2.807 3.485 3.768 
us 24 | 1.318 1.7411 2.064 2.492 2.797 3.467 3.745 
= 25 | 1.316 1.708 2.060 2.485 2.787 3.450 3.725 
= 26 | 1.315 1.706 2.056 2.479 2.779 3.435 3.707 
d 27 | 1.314 1.703 2.052 2.473 2.7741 3.421 3.690 
27 28 | 1.3413 1.701 2.048 2.467 2.763 3.408 3.674 
di 29 | 1.311 1.699 2.045 2.462 2.756 3.3926 3.659 
» 30 | 1.310 1.697 2.042 2.457 2.750 3.385 3.646 
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